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Abstract 
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1} ' We introduce a new invariant of bipartite chord diagrams and use it to construct the 

m : first examples of groups with Dehn function n 2 logn. Some of these groups have undecidable 

' conjugacy problem. Our groups are multiple HNN extensions of free groups. We show that 

I . n 2 log n is the smallest Dehn function of a multiple HNN extension of a free group with 

undecidable conjugacy problem. 

Contents 

1 Introduction 2 

lMj! 2 Why n 2 logn? 3 

3 S- machines 9 

t^j- \ 3.1 S'-machines as HNN extensions of free groups 9 

!>■ ■ 3.2 Bands 10 

3.3 Trapezia 12 

7-H I 3.4 Admissible words and computations 12 

3.5 A slight modification of the S'-machine from [11] 15 

3.6 The adding S-machine 15 

\ 3.7 A composition of S and the adding machine 22 

3.8 Properties of the S'-machine S o Z 23 

4 Properties of the group S o Z 26 
r 1 "! ■ 4.1 A modified length function on 6* o Z 27 

4.2 Combs 28 

d; 

5 Dispersion of bipartite chord diagrams 29 

6 The upper bound of the Dehn function 33 

7 The end of the proof 39 

References 40 

*Both authors were supported in part by the NSF grants DMS 0245600 and DMS 0455881. In addition, 
the research of the first author was supported in part by the Russian Fund for Basic Research 05-01-00895, the 
research of the second author was supported in part by the NSF grant DMS 9978802 and the US-Israeli BSF 
grant 1999298. 



1 



1 Introduction 



Recall that the Dehn function of a finite presentation (X \ R) of a group G is the smallest function 
f(n) such that any word of length at most n in X that represents the identity of G is freely 
equal to a product of at most f(n) conjugates of elements of R. The Dehn functions f±, ji of any 
two finite presentations of the same group G are equivalent, that is f2(n) < Cfi(Cn) + Cn + C, 
fi(n) < CfiiCn) + Cn + C for some constant C. As usual, we do not distinguish equivalent 
functions. 

The purpose of this paper is to prove the following statement. 

Theorem 1.1. There exist finitely presented multiple HNN extensions of free groups with Dehn 
function n 2 logn. Some of these groups have undecidable conjugacy problem. Conversely, if 
d(n) is the Dehn function of a multiple HNN extension of a free group and lim^^ n = 
then the group has decidable conjugacy problem. Here lim c stands for the constructive limit (see 
Definition 2.1 below). 

Remark 1.2. For those unfamiliar with the definition of constructive limit, it is the same as 
the ordinary (e, A r )-definition, only N must recursively depend on e. Theorem 1.1 implies that 
a multiple HNN extension of a free group whose Dehn function does not exceed, say, n 2 \J\ogn 
or n2 i g log'iog" iogn ' mus t have decidable conjugacy problem. 1 

In particular, this theorem gives the first example of a Dehn function between n 2 and n 4 not 
of the form n a , a £ IR (see [2], [3]). The set of Dehn functions > n 4 is known to be large, and 
contains functions of the form n a , n a \ogn, n Q loglogn for any rational a > 4 (and in fact any 
relatively fast computable a > 4), and much more complicated functions. An almost complete 
description of all Dehn functions > n 4 has been found in [11]. Unfortunately, the methods from 
[11] do not give any information about Dehn functions between n 2 and n 4 , and methods of [2] 
and [3] give Dehn functions only of the form n a . Thus the picture of the class of small Dehn 
functions is incomplete. 

Recall also that it is still unknown if there exists a finitely presented group with Dehn function 
n 2 and undecidable conjugacy problem. We believe that such groups do not exist and n 2 logn 
is the lowest Dehn function of a group with undecidable conjugacy problem in a large class of 
groups than the HNN extensions of free groups (see Section 2). Groups with subquadratic Dehn 
functions are hyperbolic [4], [6], and so they have solvable conjugacy problem [4]. 

Our groups belong to the class introduced earlier in [11] by the second author. These are 
the hub- free realizations of S'-machines (in [11] these groups were denoted by G' N (S)). It follows 
from [11, Lemma 8.1] that the Dehn function of any G' N (S) does not exceed n 3 . E. Rips and the 
second author conjectured that some of these groups have Dehn functions n 2 log n and showed 
how such groups could be constructed. Unfortunately existing methods of finding upper bounds 
of Dehn functions did not give precise upper bounds of Dehn function of G' N (S). The reason 
is that G' N (S) does not have distorted cyclic subgroups to apply methods from [2], [3], and van 
Kampen diagrams over these groups do not have hyperbolic structure which allows one to apply 
surgeries from [11] or [1]. 

In this paper, we use a new method of finding the upper bounds of Dehn functions introduced 
by the first author. The method is based on a new invariant (dispersion) of bipartite chord 
diagrams naturally associated with van Kampen diagrams over the presentations of our groups. 
These invariants resemble the invariants of cord diagrams studied in [10, 9] by Polyak and Viro 

1 The authors are able to show that one cannot replace constructive limits by the ordinary limits in Theorem 
1.1. The proof is technically more difficult than Theorem 1.1 and is left out of this paper. 
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in relation to problems of Arnold, computing Vassiliev invariants of knots, etc. As far as we 
know, this is the first use of such invariants in geometric group theory, but we are convinced that 
similar invariants will be applied to other problems including those which are far from Dehn 
functions. The main idea is that these invariants measure the complexity of a diagram, and 
allow us to perform surgeries decreasing the complexity. 

The paper is constructed as follows. In Section 2, we give a "quasi-proof of the fact that 
any finitely presented group with undecidable conjugacy problem has Dehn function at least 
n 2 log n, and a complete proof of this statement for multiple HNN extensions of free groups. 

In Section 3, we start by introducing general properties of 5-machines viewed as groups 
(multiple HNN-extensions of free groups). Thus we identify an S'-machine and the corresponding 
group. In particular, we introduce the standard notions of bands and trapezia from [11]. 

Then we show how to slow down any S-machine S so that the space function of the new 
S-machine SoZ becomes equivalent to the logarithm of the time function. Later, in Section 6, 
these properties will translate into the upper bound n 2 logn of the Dehn function of S o Z . 

Several basic properties of the group SoZ are proved in Section 4. In van Kampen diagrams 
over an S'-machine, there are two types of bands, Q-bands and 6*-bands, that start and end on 
the boundary of the diagram. These bands form a bipartite chord diagram (BCD) since bands 
of the same type do not intersect and a band of one type intersects a band of another type at 
most once. 

In Section 5, we introduce a new invariant of BCDs, the dispersion, and prove that the 
dispersion of any BCD is bounded from above by a quadratic polynomial in the number of 
chords of one of the types. In Section 6, we prove that the area of a van Kampen diagram A 
over SoZ with |<9A| < n does not exceed C(n 2 logn + V) where V is the dispersion of the 
corresponding BCD. 

This gives an upper bound of n 2 logn for the Dehn function of S o Z. In Section 7, we give 
the similar lower bound of the Dehn function and show that SoZ has undecidable conjugacy 
problem provided S has undecidable halting problem. This completes the proof of Theorem 1.1. 
Finally we show how to generalize our construction to obtain groups with other unusual Dehn 
functions between n 2 and n 3 . 

Acknowledgement. The authors are grateful to the referee for many helpful remarks. 

2 Why n 2 log n? 

In this section, we shall give a "quasi-proof of the following conjecture. Then we show that the 
conjecture is true for multiple HNN extensions of free groups. 

We shall need the well known constructive version of a limit of a sequence of numbers. In 
fact we are going to use that definition only in the case when the limit is 0. 

Definition 2.1. Let g: N — ► R be a function. We say that the constructive limit of g(n) as 
n — > oo is if for every integer A > there exists N = N(A) such that for every n > N, 
\f(n)\ < 1/A, and the function N(A) is recursive. In that case we shall write lim^^^n) = 0. 
It is easy to see that lim^^ g(n) = if and only if there exists an increasing recursive function 
f(n) such that g(k) < ^ for every k > f{n). 

Quasi-Theorem 2.2. Let d{n) be the Dehn function of a finite group presentation P. Suppose 
that lim^^ -4 j"j = 0. Then P has decidable conjugacy problem. 
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Proof. We shall need the following Lemma. We call a van Kampen or annular diagram over a 
group presentation minimal if it has minimal area among all diagrams over that presentation 
with the same labels of the contour (contours). 

Lemma 2.3. Let A be a minimal annular diagram with contours p,p' over a finite group presen- 
tation P. Let x be a shortest path connecting p and p' . Then the area of A is at least C\x\ log \x\ 
for some constant C depending on P. 

Proof. Consider the following construction. Let po = P (considered as a cyclic path) be the inner 
contour of the diagram A. Suppose that we have constructed a cyclic path pi surrounding the 
hole of the diagram in A such that pi does not have common vertices with p' . Let Ki be the 
annulus bounded by po and pi. Let Mj + i be the set of cells of A outside Ki that have common 
vertices with pi. Then let Ki + \ be the minimal annular subdiagram of A with simple contours 
that contains Ki and all cells from Mj+i. Let pi + \ be the outer contour of Ki (the inner contour 
of Ki is p = po). 

It follows that every edge of the path pi + \ belongs to the contour of one of the cells of Mj+i. 
Hence every vertex of pi + \ can be connected with a vertex of pi by a path such that 

(0) the length of the path is bounded by a constant, 

(1) it can be connected with po by a path of length at most 2 0(i) and 

(2) the number of cells in Mj + i is at least 0(|pj + i|). 

From (1), it follows that the number of subdiagrams Ki is 0(|x|). Furthermore, more than a 
half of the paths pi have length at most log c \x\ where c is, say, four times the number of letters 
in the alphabet of the presentation P. Indeed, otherwise we would have two paths pi and pj, 
i j with the same labels, and we could remove the annular subdiagram between pi and pj 
reducing the area of A (that would contradict the minimality of A). 

From (2), it follows that at least half of the subsets Mj contain at least 0(log \x\) cells each. 
Since these sets do not intersect, the number of cells in A is at least 0(\x\ log |x|). □ 

Now the "quasi-proof of the Quasi-Theorem 2.2 proceeds as follows. Suppose that P is a 
finite presentation with undecidable conjugacy problem. Suppose that the constructive limit of 
^rj~ n is 0. Then, in particular, d(n) is bounded from above by a recursive function, and P has 
solvable word problem. 

Note that if an annular diagram A with contour labels u and v has a simple path x with label 
t connecting the contours, then we can cut A along x and obtain a disc van Kampen diagram 
with boundary label t^ut^v -1 . So if |t| is recursively bounded in terms of \u\ and \v\ (for every 
u and v that are conjugate modulo P) then the conjugacy of u and v can be algorithmically 
verified. 

Pick an increasing recursive function f(n) with ^ < ^ f° r every k > f(n) where C 

is the constant from Lemma 2.3 (as in Definition 2.1). Since the conjugacy problem for P is 
undecidable, there exists a minimal annular diagram A with contours p, p' such that any path 
in A connecting p and p' has length at least f(\p\ + \p'\). Let n = \p\ + \p'\. Let x be a shortest 
path connecting p and p' . Thus 

M > f(n), (2.1) 

and so 

d(3N) <i. (2.2) 



C|x| 2 log|x| n 

2 We use the Computer Science "big-O" notation assuming that f(n) = 0(g(n)) if ^p(n) < /(n) < Cg(n) for 
some positive constant C. 
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Since x is a shortest path connecting p and p', x is simple. Let us cut A along x and obtain 
a disc diagram L with boundary label zuz~ 1 v~ 1 where z is the label of x ±l . By Lemma 2.3, the 
area of T is at least C\x\ log \x\. 

Now we can take an integer m between ^ — 1 and We attach m copies of T consecutively 
to each other along the sides labeled by z to get a van Kampen diagram II with boundary 
label zu m z~ 1 v~ m . Notice that LT is reduced because it covers A with multiplicity m (after 
identification of the two z-sides of its boundary): for any 2-cell subdiagram £ of II where the 
cells share an edge, the annular diagram A contains a copy of X, so £ cannot be reducible since 
A is minimal. The perimeter r of II is between 2\x\ and 3|x|, and the area is m times the area 
of A. So, by Lemma 2.3, the area of n is at least . By (2.2), we can deduce that 

the area of LT is bigger than d(r). This contradicts the definition of Dehn function of a group 
presentation. □ 

Remark 2.4. The only gap in the preceding argument is contained in the last phrase. Even 
though IT is reduced, we cannot guarantee that IT has minimal area among all diagrams with the 
same boundary label, and, in principal, the area of a minimal diagram with this boundary label 
may be even quadratic in terms of the perimeter r. Still we do not know any groups for which 
this proof does not work. Note that we do not need LT to be minimal: only that the minimal 
diagram with the same boundary label does not have too few cells compared to IT. Also we have 
freedom of choosing u, v, A and x. We do not need x to be a minimal length path connecting 
the boundary components of A. We only need that the area of A exceeds 0(\x\ log \x\) divided 
by a recursive function in n (depending only on the presentation). In addition, the number m 
should only be 0( iJ+Li )■ Thus Conjecture 2.2 seems true for a very large class of groups and 
possibly for all groups. 

Let P be the standard presentation of a multiple HNN extension of a free group Fx with sta- 
ble letters ti,...,tk and pairs of finitely generated associated subgroups A; L = (a^i, a^), -Bj = 
(bi i, biji) given by their free generating sets. So the defining relations of the presentation P 
are a* 1 = bi tS , i = 1, k, s = 1, ji- Here and below a* means tat^ 1 . 

As usual when one works with HNN extensions, i-bands play significant role (they are also 
called strips and corridors). We shall give a more general definition of bands in Section 3.2. 

For every letter a, an a-edge in a van Kampen diagram is an edge labeled by a ±l . A tj-band 
in a diagram over P is a sequence of cells containing tj-edges, such that every two consecutive 
cells share a tj-edge. It is well known [5] that in a reduced van Kampen diagram over P, there are 
no tj-annuli, i.e. the first and the last tj-edges of a tj-band cannot coincide. So every maximal 
ij-band in a diagram must connect two tj-edges on the contour of the diagram. In an annular 
diagram over P, every maximal tj-band either connects two edges belonging to the boundary 
or is an annulus surrounding the hole of the diagram. The contour of a tj-band has the form 
epf _1 q where e, / are tj-edges, and e and / do not have t-edges (these are the sides of the band). 

The following theorem is a part of Theorem 1.1. 

Theorem 2.5. Let d(n) be the Dehn function of P and lirn^^ n f\^ n = 0. Then P has 
decidable conjugacy problem. 

Proof. Suppose that the conjugacy problem is undecidable. We use the same notation as in the 
quasi-proof above. 

With every reduced van Kampen diagram ^ over P, one can associate a chord diagram C(^) 
where the disc is the diagram and chords are the t-bands (more precisely, their medians). 
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Lemma 2.6. Let ^ and Vl/' be two reduced diagrams with the same boundary, such that C(^f) = 
C ($!'). Then the areas of ^ and W are the same. 

Proof. Let u be the common boundary label of \I> and Vl/', and C = C(\I/) = C(*'). Then 
there exists a one-to-one correspondence T i— > T between the maximal t-bands in and 
A side of each maximal t-band T connects two vertices on d^. The label of the subpath of 
connecting these vertices is a subword of the label of d^f. Note that this subword is the 
same for the corresponding side of T'. Hence the labels £,£' of the corresponding sides of T 
and T are equal modulo P. Since both £ and £' are words from the base group of an HNN 
extension, they are equal in the base group as well. Since the base group is free, these labels are 
freely equal. Since a^j and bij freely generate subgroups Ai and Bi, the number of cells in T 
is the same as the number of cells in T (both numbers are equal to the length of the element 
£ in the corresponding subgroup A\ or Bi, and the bands T, T are reduced diagrams by our 
assumption). Since every cell in ^ and belongs to a i-band, the areas of \£ and Vl/' are the 
same. □ 

Recall that a pinch is a word of the form t{ut~ x or t~ 1 vti where u G Ai and v € Bi. Note 
that if the area of a van Kampen diagram over P is greater than 0, then its boundary label has 
at least two pinches since it is equal to 1 modulo P. 

Lemma 2.7. Suppose the boundary label I of ^ has only two pinches as a cyclic word. Then 
C(\I/) is uniquely determined by I. 

Proof. The boundary is a product of two paths pq~ x where the labels of p and q^ 1 do not 
have pinches. Hence every t-band connects a i-edge of p with a t-edge of q. Since t-bands do 
not intersect, C(\I/) is reconstructed uniquely. □ 

We say that a word W is cyclically minimal if none of the cyclic shifts of it has pinches. Note 
that if W is cyclically minimal then any power of W is cyclically minimal as well. 

Lemma 2.8. Suppose that a word W is cyclically minimal. Suppose also that a word U has no 
pinches. Then the word UWU~ l has at most one pinch. 

Proof. Indeed, if this word contains two pinches then W = W1W2W3, U = U-JJ2 where U2W1 is 
a pinch and H^L^ 1 is a pinch. But then W3W1 is a pinch, and W is not cyclically minimal. □ 

Lemma 2.8 immediately implies 

Lemma 2.9. Suppose that the words W± and W2 are cyclically minimal, and U does not have 
pinches. Then the word UWiU^W^ has at most two pinches (as a cyclic word). 

Now let us return to the proof of Theorem 2.5. Note that by Remark 2.4, we can do the 
following operations with A and x: 

• replace A by a minimal diagram Ai whose boundary labels are equal to the boundary labels 
of A modulo P and have lengths that are recursively bounded in terms of n = \u\ + \v\, 
and 

• replace x by a path x' connecting the boundary components such that |x'|/|x| is recursively 
bounded in terms of \u\ + If I. 
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In order to be able to replace (A,x) by (Ai,xi), one needs to replace the condition (2.1) by the 
condition |x| > / (fi(n)) f2(n) where f\ and fi are some fixed increasing recursive functions. 

Our goal is to choose A and the path x so that the diagram II is minimal. 

We may assume that the words u and v are cyclically minimal because we can replace all 
pinches in these (cyclic) words by words without t's (this can be done effectively since the word 
problem is solvable, and the lengths of u and v would increase only recursively). Note that a 
t-band cannot connect two edges on the same boundary component of A because otherwise u or 
v would contain pinches (as cyclic words). Thus there are two cases: (1) u and v do not contain 
t-letters, and (2) u contains t-letters (then v also contains t-letters). 

In the first case the maximal t-bands in A form annuli surrounding the hole, the outer side 
of one annulus is the inner side of the next one. In the second case, the maximal t-bands are 
radial, connecting the inner contour with the outer boundary component of A. 

Since IT is reduced, Lemmas 2.9, 2.6 and 2.7 tell us that we can claim that II is minimal 
provided we can ensure that the label z of x does not contain pinches. 

Case I. Suppose that u and v do not have t-letters. Let r be the number of t-annuli in A. 

Clearly no two sides of these t-annuli have the same labels (otherwise we could remove the 
subdiagram bounded by these two sides), which implies as in the proof of Lemma 2.3 that the 
area of A is at least 0(r log r). We can also assume that r > f(n) where f(n) is the recursive 
function from the quasi-proof. Indeed, if r is bounded by a recursive function for every A then 
the area of A is bounded by a recursive function too. 

Clearly there exists a path in A connecting p and p' and having length 0(r). So we can 
assume that x is that path (and not the shortest path connecting p and p' as in the quasi-proof). 

Suppose that the label z of x contains a pinch and x' is the corresponding subpath of x. 
Then the first and the last edges of x' are i,-edges belonging to two consecutive tj-annuli T and 
T in A. Since the label z' of x' is a pinch, there exists a diagram E consisting of one t,-band 
such that the label of 9E is z'z", and z" does not contain t-letters. Therefore we can cut A 
along x', and patch the resulting hole by gluing in a copy of E and a copy of the mirror image 
E' of E glued together along the part of the boundary labeled by z". The resulting annular 
diagram A' is not reduced. But instead of two tj-bands T and T', A' contains one i,-band T" 
whose set of cells is the union of the sets of sells in T, T , E and E'. The annulus T" does not 
surround the hole of A'. Hence it bounds a disc subdiagram <I> of A'. The boundary label of <I> 
does not contain t-letters. Hence the boundary label of $ must be equal to 1 in the free group. 
Hence can be replaced by a diagram without cells. The new diagram A" has fewer cells, a 
contradiction with the minimality of A. Thus z does not have pinches, and we are done. 

Case 2. Suppose that u and v have t-letters. 

The number s of maximal t-bands in A is bounded by min(|-u|, < n = \u\ + \v\. Let us 
cut A along a side q of a tj-band. Let T be the resulting diagram. We can assume that q is the 
shortest among the sides of the t-bands in A. We can also assume that \q\ is the smallest for all 
annular diagrams with the same boundary labels and the same area. This implies that there is 
no diagram with boundary label of the form zuz~ 1 v~ 1 whose area does not exceed the area of 
T and \z\ < \q\. 

The van Kampen diagram T has contour pq(p')~ l {q') 1 where the labels of p and q are (cyclic 
shifts of) u and v respectively, and q and q' have the same label z. We shall denote by g_ and 
q + the initial and terminal vertices of q. Two vertices V and V' in q and q' are called co-phase 
if their distances from g_ and q'_ (along q and q') respectively are the same. We say that V is 
higher (lower) than V' if the distance from V to q^ is bigger (smaller) that the distance from 
V to q'_. 

Let Ti, % be all maximal t-bands in T connecting p and p', ordered from q to q'. So q is 
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a side of 1±, q' is a side of T s . It is easy to prove that the length of each maximal t-band in T 
is at most (\q\ + n)exp(Cn) for some constant C. Indeed, the label Wi of a side of % is equal 
modulo P to the label of q multiplied on the left and on the right by two words of length at 
most \u\ + \v\. Therefore the length of Wi in the base group cannot exceed (\q\ + n) exp(C'n) for 
some constant C: when we reduce a pinch in a word, the length increases by a constant factor. 
Since the number of cells in % is 0(|u>j|), we obtain the desired inequality. 

Since \q\ >> \u\, the area of T (= the area of Aj) is at most 0(\q\ exp(C(|w| + \v\))) for some 
constant C. For every vertex V on q consider a shortest path p(V) connecting V with a vertex 
in q'. Note that \p(V)\ does not exceed a constant times s. If the vertex p{V)+ is co-phase with 
V then we say that p(V) is parallel to p. We can assume that p(V±) does not cross p(V2) for any 
V\ 7^ V2. Indeed, if p(Vi) = piP2,p(V2) = P1P2 and the end points of pi, p[ are the same, then 
the lengths of P2 and p' 2 are the same (otherwise the path p{V\) or p(V2) would not be shortest), 
and we can replace piV^) by p[p2- Thus we can talk about a subdiagram $(Vi, V2) bounded by 
p(Vi) and p(V 2 ). 

Suppose that there exist two vertices V\ and V2 such that p{V\) and p(V^) are parallel to p 
and the labels of the paths p(V\) and ^(^2) are the same. Then we can remove the subdiagram 
®{Vi, V2) of r. The resulting diagram V will have boundary label of the form zuz~ 1 v with z 
shorter than \x\, a contradiction. Hence the labels of all paths p(V) that are parallel to p are 
different. Hence the number ir of such paths is at most an exponent in C\u\ for some constant 
C. Therefore ir is small comparing to \q\. These paths cut V into at most e(n) conjugacy 
subdiagrams where e is a recursive function. One of these subdiagrams must have area bigger 
than C\q\/e(n) for some constant C. Thus we can deal with this "large" subdiagram instead of 
r. Hence without loss of generality, we shall assume that T does not have paths p(V) that are 
parallel to p except possibly for p and p' . 

Now let us number all vertices of q starting with q_: V\, V2, .... For each j = 1, 2, let l(j) 
be the distance from p{Vj) + to q'_. Since the paths p{Vj) do not intersect, and none of the paths 
p(Vj) are parallel to p except possibly for p(q~) and p(q+), either > j for all j or j > l(j) for 
all j and the inequalities are strict except, possibly, for Vj = q~ and Vj = q+- We can assume 
that the first possibility holds because otherwise we can turn V upside down switching u and v. 

Let us define a sequence of vertices on q as follows: Qi = V2 and for every j = 2,3,... 
let Qj = VJ(Q j _ 1 ). Note that Qj is co-phase with p(Qj-i)+- Let us define the path pj as 
the composition of the subpath Qj — Qj-i of q and p(Qj-\) is parallel to p. Then for some 
small enough constant c the number of paths pj of length < clog \q\ is at most \/\q\. Since 
the length of p(Qj-i) is recursively bounded in terms of |u|, we can assume that the number 
of j's such that the distance between Qj-i and Qj along q is smaller than clog|g| is at most 



y/\q\- Since each Qj is higher than Qj—i, the number of j's for which this distance is not 

smaller than clog |g| is at most |g|/c log \q\. Hence the total number of points Qj is at most 



Let go be the subpath of q connecting V\ and V2 (this path is simply an edge). Since p(Qj-i) 
and Qj are co-phase, images of paths qo and all p(Qj) in A is a path x connecting the boundary 
components of A. The length of x is at most 0(\q\/ log \q\) times a recursive function in n as 
required. Thus, by Lemmas 2.6-2.9, it remains only to show that the label z of x does not have 
pinches. 

For a contradiction, suppose that z contains a pinch z' and x' is a subpath of x whose label 
is z' . Then z' = t^u'tj 1 . The two t-bands intersecting x' are consecutive t-bands % and 7fc + i 
for some k in the annular diagram Aj (we consider k modulo s so T s+ \ = T±). Let us connect 
x'_ and x' + with vertices R, R' on the inner contour of Aj along the sides of 7^ and Tfc+i by 
paths qi, q2- The vertices R, R' can be connected by a subpath p of the boundary component 
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such that p contains the end edges of the i-bands 7^ and T^+i- If this quadrangle surrounds the 
hole of Aj, we can repeat the same construction using another boundary component of Aj. The 
resulting quadrangle won't surround the hole in that case. Since these cases are similar, we can 
assume that the initial quadrangle does not surround the hole. Then this quadrangle is a van 
Kampen diagram over P. Since the label of x' is a pinch, it is equal to a word without i-letters 
modulo P. Since the labels of q± and q% do not contain tdetters, the label of p is equal modulo 
P to a word without i-letters. Hence the (cyclic) word Ui contains a pinch, a contradiction. □ 

3 S'-machines 

3.1 .S-machines as HNN extensions of free groups 

Probably the easiest way to view an 5-machine S in the sense of [11] is to consider S as a group 
that is an HNN extension of a free group F(Q, Y) generated by two sets of letters: state letters 
Q = ufLiQi an d tape letters Y = L)^J[ 1 Yi where Qi are disjoint and non-empty. The sets Qi 
(resp. Yi) are called parts of Q (resp. Y). 

We shall follow the tradition of calling state letters q-letters and tape letters a-letters, even 
though we shall use k with indexes for state letters and y with indexes for tape letters. 

Instead of the set of stable letters we have a collection of iV-tuples of ^-letters or rules. 
The components of 9 are called brothers 9i,...,0n. In this paper, we always assume that all 
brothers are different. We set 9n+i = Oi, Yq = Yjy = 0. 

To every G 0, we associate two sequences of elements in F{Q U Y): B{6) = [U±, ...,Un], 
T{6) = [Vi, V N ], and a subset Y{9) = UYj(0) of Y, where Y0) CY*. 

The words Ui,Vi satisfy the following restriction: 

(*) For every i = 1, N, the words Ui and Vi have the form 

Ui = Vi-ikiUi, Vi = v'^k'ju'i 

where ki,k[ 6 Qi, Ui and u\ are words in the alphabet Y^ 1 , Vi-i and v' i _ l are words in the 
alphabet Y±\. 

The generating set X of S consists of all q-, a- and ^-letters. The relations are: 

Ui9 i+ i = 6iVi, i = l,...,s, 9ja = a8j 

for all a G Yj{6). The first type of relations will be called (q, 9)-relations, the second type - 
(a, 9)-relations. 

Sometimes we will denote the rule 9 by [U\ — ► V±, Un — > Viv]. This notation contains all 
the necessary information about the rule except for the sets Yi{9). In most cases it will be clear 
what these sets are. In the S'-machines used in this paper, the sets Yi(9) will be mostly equal 
to either Y, L or 0. By default Y0) = Y, L . 

In order to simplify the notation, we will use the notation VikiUi — > v'^u^ for a part of a rule 
when the corresponding Yi(9) is empty (a similar notation has been used in [7]). 

Every 5-rule 9 = [U\ — > V±, U s — > V s ] has an inverse = [Vi — > U\, V s — > ?7 S ]; we set 
Yi(# _1 ) = 11(0). We always divide the set of rules of an S'-machine into two disjoint parts, 
+ and 0~ such that for every 9 G + , G 0~ and for every G _ , 6* _1 G + . The rules 
from + (resp. 0~) are called positive (resp. negative). 
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Remark 3.1. 1. Every S'-machine is indeed an HNN-extension of the free group F(Y, Q) with 
finitely generated associated subgroups. The stable letters are 9\ for every 6 £ Q. We leave it 
as an exercise to find the associated subgroups. 

2. Notice that in [11], a slightly different notation for rules of S'-machines was used. Instead 
of, say, the rule 

[vokiUi — > v' k' 1 u' 1 ,vik2U2 — ► v^k^u^], (3-3) 

one would use the notation 

[vokiUiV\k2U2 — ► fQ^^f^A;^^]- (3-4) 
But two relations corresponding to the rule (3.3): 

6^ 1 VQkiUi = v'ok'iu'iO^ 1 

and 

9 2 ~ 1 vik2U2 = v'lk^u^O^ 1 
are Tietze equivalent to one relation corresponding to (3.4): 

9^ 1 vok\UiVik2U2 = v^k^u^v^k^u^O^ 1 

since 02 is expressible in terms the other generators. Therefore these notations are equivalent. 

3.2 Bands 

From now on, we shall only consider reduced van Kampen diagrams i.e. diagrams that do not 
contain cells that have a common edge and are mirror images of each other. Hence all van 
Kampen diagrams are assumed to be reduced. To study van Kampen diagrams over the group 
S we shall use bands and trapezia as in [11], [1], etc. 
Here we repeat necessary definitions from [11]. 

Definition 3.2. Let M be a subset of X. An M-band B is a sequence of cells 7Ti, ...,7r n in a 
van Kampen diagram such that 

• Each two consecutive cells 7Tj and 7Tj + i in this sequence have a common edge ej labeled by 
a letter from M. 

• Each cell 7Tj, i = l,...,n has exactly two M-edges, ej_i and e,- L (i.e. edges labeled by a 
letter from M). 

• If n = 0, then B is just an M-edge. 

The counterclockwise boundary of the subdiagram formed by the cells tt±, 7r n of B has the 
form e~ l q\fq2~ l where e = eo is an M-edge of tti, f = e n is an M-edge of Tr n . We call q\ the 
bottom of £> and 52 the top of 23, denoted bot(23) and top(£>). 

Consider lines Ifa, e«) and l(iri, ej_i) connecting a point inside the cell 7Tj with midpoints of 
the M-edges of 7Tj. The broken line formed by the lines Z(7Ti,e), Z , ) , Z(7Tj,ej_i), l(ir n , f) is 
called the median of the band 23. It connects the midpoints of e and / and lies inside the union 
of 7Tj. The M-edges e and / are called the stori and end edges of the band. If n = 0, then the 
median is the midpoint of e = /. 
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We say that an Mi -band and an M2-band cross if their medians cross. We say that a band 
is an annulus if its start and end edges coincide. In this case the median of the band is a simple 
closed curve. 

The subdiagram bounded by the median of an annulus in a disc diagram is called the inside 
diagram of this annulus. 

Let M\ and M2 be two disjoint sets of letters, let (tt, tti, . . . , 7r n , tt') be an Mi-band and let 
(it, 71, . . . , 7 m , tt') be an M 2 -band. Suppose that: 

• the medians of these bands intersect in two points A, B inside tt and tt' , and the parts of 
the medians between A and B form a simple closed curve, 

• on the boundary of tt and on the boundary of tt' the pairs of Mi -edges separate the pairs 
of M2 -edges, 

• the start and end edges of these bands are not contained in the region bounded by the 
medians of the bands. 

Then we say that these bands form an (Mi, M2)-annulus and the simple closed curve formed by 
the parts of medians of these bands is the median of this annulus. For every annulus we define 
the inside subdiagram of the annulus as the subdiagram bounded by the median of the annulus. 

We shall call an M-band maximal if it is not contained in any other M-band. 

As in [11], we can consider g-bands where M is one of the sets Qi, #-bands for every 6 6 0, 
and a-bands where M = {a} C Y. Every eel of a g-band is a g-cell by definition. A g-cell is also 
a (g, #)-cell (and also (6, g)-cell) since it corresponds to a (g, #)-relation. The convention is that 
a-bands do not contain g-cells, and so they consist of (a,#)-cells (= (#,a)-cells) only. 

The following lemma has been essentially proved in [11]. 

Lemma 3.3. A reduced van Kampen diagram A over S has no q-annuli, 6-annuli, (q,9)-annuli, 
a-annuli, (a,6)-annuli. 

Proof. We assume that A is a counterexample with minimal area. This means in particular that 
the boundary of A is the boundary component of an annulus A, where A has one of the types 
from the formulation of the lemma. 

(1) Suppose A is a g-annulus. Then it consists of (q, 6*)-cells. Hence there is a maximal 
#-band T in A, whose first cell LTi and the last cell II2 belong to A. Being members of the 
same (/-band A and #-band T, the cells IT and II2 cannot be neighbors in T (the diagram is 
reduced). Hence T and a part of A form a (6, g)-annulus whose area is smaller than that of A. 
This contradicts the choice of A. 

(2) Suppose A is a #-annulus. If it contains g-cells, then we come to a contradiction as in (1). 
Otherwise A has no g-cells since there is no counter-example of smaller area. The inner part 
of A has no 8-edges for the same reason. Hence F has no cells corresponding to the relations 
of the group S. So, on the one hand, the inner label of A is a cyclically reduced non-empty 
word in Y since A is a reduced diagram, and on the other hand, this word is freely equal to 1, 
a contradiction. 

(3) Suppose A is a (g, #)-annulus. Then the maximal g-band T of A cannot have more than 
two cells because otherwise A would contain a smaller counterexample as in (2). Hence the 
length of T is 2, and its cells are mirror copies of each other, a contradiction (we assumed that 
the diagrams are reduced). 

(4) Suppose A is an a-annulus. Then its boundary labels are words in ^-letter. This leads, 
as in (1), to a smaller (a, #)-annulus, a contradiction. 
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(5) Suppose A is a (0, a)-annulus and let T be the maximal a-band of it. It cannot have 
more than two (0, a)-cells because otherwise there would be a smaller (0, a)-annulus. Hence the 
length of T is 2, and its cells are mirror copies of each other, a contradiction. The lemma is 
proved. □ 

3.3 Trapezia 

If W word in an alphabet X, Y is another alphabet, and (j): X — > Y U {1} 

(where 1 is the empty word) is a map, then <f>(W) = <f>{xi)...(f>(x n ) is called the projection of 
W onto Y. We shall consider the projections of words from S onto G (all ^-letters map to the 
corresponding element of 0, all other letters map to 1), and the projection onto the alphabet 
{Qi, Q n } (every (/-letter maps to the corresponding Qi, other letters map to 1). 

Definition 3.4. The projection of the label of a side of a (/-band onto the alphabet is called 
the history of the band. The projection of the label of a side of a #-band onto the alphabet 
{Qi, Q n } is called the base of the band. Similarly we can define the history of a word and 
the base of a word. The base of a word W is denoted by base(VF). It will be convenient instead 
of letters Qi, Q n , in base words, to use representatives of these sets. For example, if k\ G Qi, 
k>2 € Q2, we shall say that the word kiak-z has base k\k2 instead of Q\Q2- 

Definition 3.5. Let A be a reduced van Kampen diagram which has the contour of the form 
Pi 1 QiP2Q2 1 where: 

(TRi) p\ and P2 are sides of (/-bands, 

(TR2) qi, q2 are maximal parts of the sides of 6*-bands such that <j>{q\), ^(52) start and end 
with q- letters, 

(TR3) for every #-band T in A, the labels of top(T) and bot(T) are reduced. 

Then A is called a trapezium. The path q\ is called the bottom, the path q2 is called the top 
of the trapezium, the paths p\ and P2 are called the left and right sides of the trapezium. The 
history of the (/-band whose side is p\ is called the history of the trapezium; the length of the 
history is called the height of the trapezium. The base of p\ is called the base of the trapezium. 

Remark 3.6. Property (TR3) is easy to achieve: by folding edges with the same labels having 
the same initial vertex, one can make the boundary label of any subdiagram in a van Kampen 
diagram reduced, see [11]. 

Remark 3.7. Notice that the top (bottom) side of a #-band T does not necessarily coincide 
with the top (bottom) side of the corresponding trapezium of height 1, and is obtained from 
top(T) (resp. bot(T)) by trimming a few first and last a-letters. We shall denote the trimmed 
top and bottom sides of T by ttop(T) and tbot(T). 

3.4 Admissible words and computations 

Using trapezia, one can now formally define admissible words and application of a rule to a 
word. 

Definition 3.8. Let A be a trapezium of height 1. Let be the element of Q ±l whose repre- 
sentative (one of the brothers) is written on a side of that trapezium. Then the word W written 
on the bottom of of A is called an admissible word for 0. The word written on the top of A is 
called the result of application of to W and is denoted by ■ W. Clearly, ■ W is uniquely 
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determined by 9 and W, and 9~ l • {9 • W) = W (hence the label of the top of the trapezium is an 
admissible word for 9~ 1 ). We call a word admissible if it is admissible for some 9 G 0. For every 
word / = /i/2-"/n i n @ we define / • W as /„ ■ (...(/2 ■ (/i ■ W))...). In particular, 1 • W = W for 
every word W. 

Recall that we assume that all components in every iV-tuple 9 are different. In order for a 
rule 9 to be applicable to a word W = q\U\q2---u n -\q n where qi are from Qf,\, itj is a word in 
y^ 1 , the (q, #)-relations involving % and Oj+i must share a ^-brother, and that ^-brother must 
commute with all letters of U{ that are not involved in the relations containing qi, Oj+i. This and 
the definition of admissible word immediately imply the following lemma. 

Lemma 3.9. Every admissible word of a rule 9 has the form qiuiq-2...u n -iq n where for every i 
from 1 to n there exists j(i) such that qi G Qfuy an d 

• If qi G Q j{ i) then Uj is a group word in Y j{ ^ and q i+1 G Qj(i)+\ U Qj^; 

• If Qi G then m is a group word in Yj^_± and q i+1 G Q j{i) U Q~L_ V 

By Lemma 3.3, any trapezium A of height h > 1 can be decomposed into #-bands 71, 
connecting the left and the right sides of the trapezium. The word written on the trimmed top 
side of one of the bands % is the same as the word written on the trimmed bottom side of i = 
l,...,h. Therefore with every trapezium A we can associate a sequence of words W\, W2, W/j+i 
and a sequence of rules 0\, #2, 9 h such that W2 = 9\ ■ Wi, W3 = 62 ■ W2, Wh+i = 9 h ■ Wh- 

This pair of sequences will be called a computation of S connecting W\ and Wh+i- We shall 
denote the computation by 

Wi -+ ei >e h ^ W h ^e h W h+1 , 

or simply 

Wi > W fe W ft+ i. 

The number /i is called the length of the computation. Since we consider only reduced 
diagrams, the history of every trapezium is a reduced word. That word t = 9\92--9h is called 
the history of computation. The area of the trapezium is called the area of computation. The 
length of the longest word Wi in this computation is called its width. It is also convenient to 
consider empty computations consisting of one word W. The history of an empty computation 
is the empty word, the start and end words of this computation are equal to W. 

Notice that W\f = fWh+i for some words /, /' in 9- and a-letters whose projections onto 
G are equal to the history word t. It is easy to see that |/| = 0(\t\). 

Remark 3.10. One can easily see that the computation W — ► W\ — > ... looks like a computation 
of a Turing machine with many heads, the (/-letters. Heads can move left and right, change their 
states, and change a-letters (which play the role of tape letters) around them. 

As for a usual Turing machine, we choose a distinguished stop word W from F(Q,Y). 
We say that a word W G F(Q, Y) is accepted if there exists a computation connecting this 
word and W . 

The following lemma immediately follows from the definition of a computation and Lemma 

3.3. 

Lemma 3.11. Let IS. be a trapezium with bottom label W and top label W . Then there is a 
unique computation W — > ... — > W' whose history is the history of A. 
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We shall also need the following lemma. 

Lemma 3.12. Let Wq — ^ W\ — ► ... — >g t Wf 6e a computation of an S -machine S. Suppose 
that Wi = Wj and Qi / f or some 1 < i < j < t. Then 

Wo Wi... Wi ^e j+1 W t (3.5) 

is again a computation of S. 

Proof. This surgery amounts to removing (9-bands number i + 1, j (counted from the bottom) 
in the trapezium A corresponding to the initial computation. Let us show that the new diagram 
A' is reduced. Indeed, pairs of cells in the same 6*-band in A' cannot cancel because the same 
pair of cells existed in A. The cells from different #-bands in A' cannot cancel because either 
the same pair of cells exists in A or one of these cells is in the i-th #-band of A, and the other 
one is in the j + 1-st #-band of A, and these cells cannot cancel in A' because 0j / ®J+i by our 
assumption. The conditions (TRi), (TR2), (TR3) obviously hold for A'. So A' is a trapezium, 
and (3.5) is a computation. □ 

Lemma 3.13. Assume that two admissible words W and W' are conjugate in the group S. 
Then there exists a computation W — ► ... — > W" of S where W" is a cyclic conjugate of'W' that 
starts with a q-letter from the same Qi as the first letter ofW. 

Proof. By the van Kampen - Schupp lemma, there is a reduced annular diagram A whose 
boundary components p and p' are clockwise labeled by W and W'. It follows from Lemma 3.3 
that, for some h > 0, A is a union of concentric #-annuli T±, . . . ,7^, where T\ is attached to 
p, T2 has a common boundary component with Ti,..., Th is attached to p'. Let k be the first 
letter of W. Then a maximal fc-band connects p and p' . We may assume that h > 0. Cutting 
A along a side of this A;-band, we get a trapezium. By Lemma 3.11, there is a computation of 
S connecting W with a word W" that is a cyclic shift of W'. Since all g-edges of a (/-band have 
labels from the same set Qi, the first letters of W and W' are from the same Qi. □ 

Remark 3.14. Suppose that the base of a trapezium A starts with a Qi-letter and ends with 
a QAr-letter. Then the labels of the sides of the trapezium are the same and do not contain 
a- letters: it follows from the agreement that Qn+i = Qi, and that in every part of an S'-rule of 
the form vok\U\ — ► v^kiu^ (resp. vjy-ikNUN — ► v' N _ 1 k' N u' N ) the words vo,v' (resp. un, u' n ) are 
empty since they are words over empty alphabets Y$ and Yjy. 

Lemma 3.15. Suppose that the stop word W starts with a letter from Qi and ends with a 
letter from Qn and has only one Q\-letter. Suppose that the language of accepted words is not 
recursive. Then the set of words that are conjugates of W in S is not recursive. Hence S has 
undecidable conjugacy problem. 

Proof. Since the first letter in W is from Q\ and the last letter is from Qat, the left and right 
sides of any trapezium with the bottom label W are the same by Remark 3.14. Hence if W is 
accepted, it is a conjugate of W. Conversely, if W starts with a Qi-letter and is a conjugate 
of W, then by Lemma 3.13, there exists a computation W — ► ... — > W' where W' is a cyclic 
conjugate of W starting with a Qi-letter. Since W contains only one Qi-letter, W' = W, so W 
is accepted. □ 
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3.5 A slight modification of the S'-machine from [11] 

Let £ be a recursively enumerable language over an alphabet X. Then by [11, Proposition 4.1], 
there exists an S'-machine S recognizing C in the sense of Lemma 3.16 below. 

For that S'-machine, N = 17, so the set of (/-letters is partitioned into 17 subsets which will 
be more convenient to denote by K±, K^. The elements of Ki will be denoted by ki(j). The 
stop word W is ki(0)k 2 (0)--.kN(0). The set X is contained in Y\, and for every positive word u 
in X we denote a(u) = /ci(l)u/c2(l)--.fcjv(l). The following lemma is proved in [11]. 

Lemma 3.16. [11, Proposition 4-1] (a) For every u 6 C there exists a computation a(u) —*q 1 
■ — > W consisting of positive words. 

(b) For every word u C over X the word <j{u) is not accepted. 

Let us modify S a little. We add two new sets of state letters K^+i = {fcjv+i}, K^+2 = 

{fe/v_l_2}, and a new state letter kj in every Kj and a new set Y/v+i = {a} of tape letters. We 

- i — 

also add two rules 770, rji of the same form [kw+i —> kjy+i, kjy+2 — ► ctkN+2, kj — ► fcj, j = 1, ...,N\. 
Note that now the new number of parts of Q is N + 2, so we have to count modulo N + 2 and 
instead of the assumption that 6n+i = Oi we have to assume that 0n+3 = 
Notice that the new machine admits computations of the form 

ki...k N k N+ ik N+2 -^q ... ^ T;o ki---kNkN+ia m k N+2 ^-1 ... ^-1 fci...fcjvfeiv+iA;jv+2- (3.6) 

We keep notation 5 for the new S-machine and we keep notation iV for the number of parts 
of the set of Q-letters (instead of N + 2). 

Our goal is to cross-breed S with another S-machine in order to slow it down whilst preserving 
the width of computations. 

3.6 The adding S'-machine 

Let A be a finite set of letters. Let the set A\ be a copy of A. It will be convenient to denote A 
by Aq. For every letter ao £ A§ let a± denote its copy in A±. Consider the following auxiliary 
"adding" S-machine Z(A). 

Its set of state letters is Pi U P 2 U P 3 where P 1 = {L}, P 2 = {p(l),p(2),p(3)}, P 3 = {R}. The 
set of tape letters is Y\ U Y 2 where Y\ = Ao U A\ and Yi = Aq. 

The machine Z(A) has the following positive rules (there a is an arbitrary letter from A). 
The comments explain the meanings of these rules. 

• n (a) = [L ^ L, P {1) ^ a^p{l)a ,R ^ R}. 

Comment. The state letter p(l) moves left searching for a letter from Aq and replacing 
letters from A\ by their copies in Aq. 

• r 12 (a) = [L — > L,p(l) a^ 1 a 1 p(2),P i?]. 

Comment. When the first letter ao of Aq is found, it is replaced by a±, and p turns into 
p{2). 

• r 2 (a) = [L ^ L,p{2) ^ aQ P {2)a^\R ^ R]. 
Comment. The state letter p{2) moves toward R. 
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. r 2 i = [L^ L,p(2) -A p(l), ii - fl], ^(rai) = Fi, Y 2 (r 21 ) = 0. 
Comment. p(2) and i? meet, the cycle starts again. 

. n 3 = [L ^ L,p(l) - p(3), R - i?], Fi(n 3 ) = 0, F 2 (r 13 ) = A)- 

Comment. If p(l) never finds a letter from Ao, the cycle ends, p(l) turns into p(3); p and 
L must stay next to each other in order for this rule to be executable. 

. r 3 (a) = [L - L,p(3) - aop(3)a " 1 , i? - i?], Yi(r 3 (a)) = F 2 (r 3 (a)) = A 

Comment. The letter r 3 returns to i?. 
For every letter a e A we set rj(a _1 ) = rj(a) -1 (i = 1, 2, 3). 

The natural projection of every admissible word of Z(A) onto A takes a\ and ao to a, and 
all other letters to 1. The following lemma is obvious. 

Lemma 3.17. Let be a rule in Z(A), W = qiuiq2...u n q n +i with base qiq2---Qn+i be an ad- 
missible word for 9. Then 9 • W = q[wiUiW2q 2 w 2 U2--.w n+ iq' n+1 where Wj and Wj are empty if 
qj € {L,i?} ±:L and if qj = p£ {p(l),p(2),p(3)} ; then Wj = w{p) and Wj = w(p) are determined 
by the following formulas: 



w(p) 



(o ai) 



ifn = r 1 (a),p = p(l), 
ifn = r 2 (a),p = p(2), 
ifn = r 3 (a),p = p(3), 
ifn = r 12 {a) e ,e = ±l,p 
ifn e {rfg 1 ,^ 1 }, 



if (p) = < 



a 



-i 



ifn 
ifn 
ifn 
ifn 



p(l) i/e = l,p = p(2) i/e = -1, 



n(a),p =p(l), 
r 2 (a),p =p(2), 
r 3 (a),p =p(3), 



(3.7) 



(3.8) 



H2W 
e {rfg 1 ,^ 1 }. 



Finally if qj =p G {p(l) _1 ,p(2) _1 ,p(3) _1 } ; £/ien = -^(p -1 ) -1 , ?Dj = w(p -1 ) -1 . 

Lemma 3.17 immediately implies. 
Lemma 3.18. Suppose that an admissible word W has the form LupvR (resp. p~ 1 upvR) where 



u, v are words in (Aq U A-\ 



Lei 61 • W = Lu'p'v'R (resp. 9 W = (p'^u'p'v'R). Then the 



projections of uv and u'v' (resp. v 1 uv and (v') 1 u'v') onto A are freely equal. 

Remark 3.19. If we replace every letter in Ai by its index i, then every word u in the alphabet 
Aq U A\ turns into a binary number b(u). If the machine starts with the word Lup(l)R where 
u is a positive word in Aq, then b(u) = and each cycle of the machine amounts to adding 
1 to b(u). After 2l"l cycles the machine stops, the admissible word becomes Lup(3)R. Let us 
compute the length of this computation. 

Notice that if u = (&i)o---(&n)o f° r some b, L G A and at the beginning of a cycle of the 
computation the last k a-letters in the admissible word are from A\ and the a-letter number 
n — k is from Aq (k < n), then this cycle of the computation has the following history 



16 



r 1 (b n )...r 1 (b n - k+1 )r 1 2{b n - k )r2(b n - k+1 )...r 2 {b n )r2i 



(3.9) 



(the letter p(l) moves left searching for the first letter from Ao and replacing every a-letter from 
A\ by the corresponding letter from A§\ then (b n -k)o is replaced by {b n -k)\ andp(l) is replaced 
by p(2); then p(2) moves right, and after it meets with R, it is replaced by p{\) again). The 
length of the cycle is 2k + 2. The number of cycles of this length is 2 n ~ h ~ 1 . Therefore the total 
length of all of these cycles is Yl=l ( 2k + 2)2 n ~ fc " 1 . The length of the last cycle is 2n + 1. Hence 
the total length of the computation is 

n— 1 n— 1 , 1 

+ 2)2 n " fc - 1 + 2n + 1 = 2 n (J^ + 2n + 1< 6 • 2 n 

k=0 k=0 

for every n since ^2^=0 = ^ anc ^ 2 • 2 ra > 2n + 1 for every n > 0. Hence the length of the 
computation is between 2 n and 6-2™. 

Informally speaking, the remaining part of the section is devoted to describing all possible 
computations of the machine Z(A). In the next section, we shall use that information to describe 
computation of a composition of the S'-machine from [11] and Z(A). We first consider the case 
when the base is LpR. We show that if the machine Z{A) works without changing the length of 
an admissible word, then the computations are in some sense unique and are subcomputations of 
the computations described in Remark 3.19. Then we consider computations where the lengths 
of words can change. We show (and this is a standard feature of S*-machines used in [11] and 
other papers) that as soon as the length of an admissible word increases during the computation, 
it cannot decrease again later, that is there are no trapezia that look like a honey pot (width in 
the middle is bigger than the width of the bottom and the top). If the base is not normal, say, 
it is p _1 pR, we show that, again, there are no very wide "honey pots". 

Lemma 3.20. Suppose W is a word with base(W) = LpR. Then there are at most two rules 
of Z(A) that are applicable to W word without changing its length. 

Proof. Let W = LupvR. Assume that 8 ■ W = W' = Lu'p'v'R for a rule 6. If 8 = r±(a) for 
some a £ A ±l then the only ways equality \W\ = \W'\ can occur is when either v starts with 
<2q 1 or u ends with a\ (by Lemma 3.17). Thus there are at most two choices for a in that case. 
A similar statement holds for r2(a), r^(a). Moreover, since applicability of rj(a) determines the 
value of p = p(i), rules of only one of these three types can apply to W. For 6 = ri2(a), we have 
\W\ = \W'\ only if u ends with ao but not with a^ 1 aQ and p = p(l). Hence if ^12(0) does not 
change the length then only one other rule does not change the length (either r\(b) where b~ l 
is the first letter of v or r^i if v is empty). Similar arguments hold in all other cases. □ 

Lemma 3.21. For every admissible word W with base(W) = LpR, every rule 6 applicable to 
W , and every natural number t > 1, there is at most one computation W -^q W\ W% of 

length t where the lengths of the words are all the same. 

Proof. Let W -^e W± — ^ W2 — > ... —^e t -\ be a computation where all words have the 

same length. By Lemma 3.20, W2 is completely determined by W\ (because the history of the 
computation is a reduced word, and so 6\ / #~ 1 ), W3 is completely determined by W 2 , etc. □ 

For every word / and every i < \ f\ we denote the prefix of / of length i by f[i]. 



17 



Lemma 3.22. Let W = LvpuR, where p G {p(l),p(2)}, u is a word in A Q , and v is a word in 
(Aq U Ai)^ 1 . Suppose that a non-empty computation 

W = Wi f-W (3.10) 

is such that all words W s in the computation have the same length, f • W = Lv'p'uR, p' G 
{p(l),p(2)}, and either 

(1) p = p(2) and f[l] = r2(a) or 

(2) p = p(l) and f [l] = ri(a) -1 

where a is the first letter of u. Suppose also that f[j] • W does not have the form Lv"p"uR 
for every 1 < j < \ f\. Then u is a positive word, v ' = v and p' = p(l) in case (1) and p' = p(2) 
in case (2). Moreover, under the above conditions, the computation (3.10) is unique. 

Proof. We shall consider only the case when p = p(2). The other cases are similar. Let u = 
bib2...b n , bi G A^ 1 . Suppose that u is not positive, u = u[k]b~ 1 bk+2---b n where 6^+2, ■■■,b n G A 
(it could happen that bk+2---b n is empty, i.e. n = k + 1). 

Let f[l] = 9. Since all words in the computation have the same length, we can conclude, by 
Lemma 3.21, that for every j between 1 and |/| there exists exactly one computation of length 
j starting with W -^g 9 ■ W. The history of that computation must be f[j]. 

Using Remark 3.19 and (3.9) one can find a computation W -^g 9 ■ W — ► ... with history g 
of the form 

9 = ^2(&i)---r2(6r i )r2in2(&n)r2iri(& n )ri2(fe„_i)r2(6„_i)r 2 (6„)r2i...r2iri(6„)...ri(& fc +2), 

and g ■ W = Lvuib~ l p(l)bk+2---b n R- Therefore either g is a prefix of / or / is a prefix of g. 

Note that / cannot be a prefix of g because f ■ W = Lv'p'uR and there is no j such that 
g[j] ■ W has that form. Hence g is a prefix of /, i.e. g = f[j] for some j. Moreover j < \f\. 

Notice that since b G Aq, there is no rule except for ri(6fc +2 ) _1 (if n > k + 2) or r^ 1 (if 
n = k + 1) which can be applied to that word without increasing its length. This contradicts 
the assumption that / is reduced. Thus u is positive. 

Now we can consider the computation with the history of the form 



h = r2(6i)...r2(6 n )r2iri2(6„)r2iri(6 n )ri2(&„-i)r2(6 n _i)r 2 (6„)r2i...r2iri(6 n )...ri(6i) 

(see (3.9) again) such that h-W = Lvp{l)uR and words h[i] ■ W, i < \h\, do not have the form 
Lv"p"uR. Since no words f[i] ■ W have that form by assumption, we can conclude that / = h. 
This completes the proof. □ 

Now we shall find out what happens when the lengths of the words change during a compu- 
tation. 

Lemma 3.23. Let W , be an admissible word with b&se(W) = LpR. Let 9 G {ri(a) ±1 , r2(a) ±1 , 
r3(a) ±1 }. Suppose that \9 ■ W\ > \W\. Then for every computation W -^g W\ —>$/ W2 we have 
IW2I > I Wi I . Moreover, if 9' is not of the form ri2(a) =tl , then the p-letters in W2 and W\ are 
the same and W2 does not have a 2-letter subword of the form pR. 

Proof. Suppose that 9 = r\{a) (the other cases are similar). If |r • W\ > \W\, then r • W = 
Lua^ 1 p(l)a\vR where the right hand side is a reduced word. It is easy to see that the only rule 
that can apply to r ■ W without increasing the length is r _1 , and the only type of rules that can 
apply are n (b) and rn (b) . This immediately implies both statements of the lemma. □ 
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Lemma 3.24. Let W = LvpuR and base(VF) = LpR. Suppose that \0 ■ W\ > \W\. Then for 
every computation W — W\ — ► W2 —>...—>/■ W, we Ziaue |Wj| > \W\ for every i > 1. 

Proof. By contradiction, suppose that there exists a computation W — >g Wi —>...—►/• W such 
that |/ ■ W\ < |W|. Consider such a computation with the smallest |/| = t and smallest |it| for 
all such computations of length t. Then \W\ < \Wi\ = ... = |Wt_i| > \Wt\. By Lemma 3.23, 
9 = ri2(a) ±1 for some a. 

We have 9 ■ W = Lv{a$ l ai) e p(m)uR for some e £ {—1, 1}, m = (e + 3)/2, and some words 
u, v where all letters in u belong to A^ 1 . We shall assume that e = 1 (the other case is similar). 
Som = 2. 

Suppose that the letter a\ inserted by f[l] is not touched during the computation W\ — ► 
... — > Wt_i (i.e. it is not cancelled with a letter inserted by one of the rules of this computation). 
Since the letters of u do not disappear during the computation W\ Wf_ 1 (they may only 

change indices from to 1 by Lemma 3.18), the word Wt-i has the form Lva^ 1 a\U\pu2R where 
U2 is a suffix of u and u\ is obtained from the corresponding prefix of u by changing indices of the 
letters. Applying Lemma 3.23 to the inverse computation Wt — > Wt-i — > ... — > W we conclude 
that the last rule <?' in / is ri2(fc) ±:L for some 6. Hence Wt = Lv'p'u2R- Since \Wt\ < \Wt-i\ and 
since / is a minimal counterexample, we conclude that ui must be equal to u. So u\ must be 
empty (otherwise the computation Wt — ► Wt_i W would be a smaller counterexample 

since it has the same length as W\ — > ... — > Wt, but starts with Wt = Lv'p'u2R with j^l < M), 
and either p = p(2), and the last rule of / is ri2(a) _1 or p = p(l), the first letter of u is Oq 1 , 
and the last rule of / is ri(a) -1 . By Lemma 3.22, both cases are impossible. 

Therefore we can assume that a\ is touched by the computation. Hence, by Lemma 3.22 
applied to the computation W\ — > ... — > Wt-i, u is a positive word, and if s is the number of 
the rule that touches a±, we have s < t — 1, /[s] • = LvaQ 1 p(l)aouR and the rule number s 
in / is n(a). Since it is positive, the word a^u is reduced. But then every rule of Z(A) except 
ri(a) -1 would increase the length of the word, a contradiction. This completes the proof of the 
lemma. □ 

Lemma 3.25. Let h&se(W) = LpR. Then for every computation W = Wq — ► W\ — > ■ ■ • — ► 
W t = f ■ W of the S -machine Z(A): 

1. \Wi\ <max(|W |,|/-W|) ; i = 0,...,t, 

2. If W = LupR where p = p(l) (resp. p = p(3)), f ■ W contains p(3)R (resp. p(l)R) and 
all a-letters inW,f-W are from A^ 1 , then the length of f is between 2l u l and 6 • 2\ u \, u is 
a positive word, and all words in the computation have the same length. 

Proof. 1. Immediately follows from from Lemma 3.24. 

2. We consider the case when W = Lup(l)R, the other case is similar. If u is empty, the 
statement is obvious. So assume that u is not empty. By Lemmas 3.24 and 3.18, all words in 
the computation have the same length. 

The letter p(3) can occur only after rule ri3 is executed. The admissible word f[i] ■ W to 
which ri3 is applied must have the form Lp(l)uR. Indeed, the a-letters in f[i] • W must be 
from A (by the definition of r±s), and the projection of W and f[i] ■ W onto A must coincide 
by Lemma 3.18. The word f[i — 1] ■ W in the computation must be La\p{l)u' R where ao is 
the first letter of u, a$u' = u. Hence during the computation the first letter of u must change 
the index from to 1. Hence for some 12 < h, we have /[^J ■ W = Laop(2)u' R and u[l] is a 
positive letter. Applying now Lemma 3.22 to the smallest initial part of the subcomputation 
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f[i 2 ] ■ W, f[i — 1] • W satisfying the conditions of this lemma, we conclude that u is a positive 
word. 

The first letter in / is either r 12(a) or r 2 f. Suppose first that f[l] = r 2 \. 

By Remark 3.19 there exists a computation connecting Lup(l)R and Lup{3)R. It has the 
history g = r 12 (b 1 )r 21 r 1 (b 1 )r 12 (b 2 )r 2 (b 1 )...r 13 ...r 3 (b 2 )r 3 (b 1 ) where u = b s ...b 2 b 1 , \ G A . Hence, 
by Lemma 3.21, either g is a prefix of / or / is a prefix of g. But note that there is only one 
rule applicable to the word Lup{3)R that does not increase its length (namely r 3 (bi)~ 1 ). Hence 
f = g and so \ f\ is between 2l u l and 6 • 2l u L 

Now let f[l] = r 2 f. Then, as in the previous paragraph, we deduce that / must start with 
f[s + 1] = r 2 ~fr 2 (bi 1 )r 2 (b 2 1 )...r 2 (b~ 1 ). But then f[s + 1] • W = Lp(2)uR and there is no rule 
except r 2 (b s ) that can be applied to this word without increasing the length. The s + 2-nd rule 
in / cannot be r 2 (b s ) since / is reduced. Thus this case is impossible. □ 

Lemma 3.26. Let W = Wq = (p(°)) -1 ifo2^ be an admissible word with base(W) = p~ 1 p (p^> G 
{p(l),p(2),p(3)}. Let W -+ 6l Wi ^e 2 ••• ->o t w t be a computation. Let Wj = (p^) -1 «jP^, 
Uj = w^Uj-iWj where Wj is defined in Lemma 3.17, j = 1,2,.... Suppose that none of p^ is 
equal to p(3) . Then: 

1. The word Q\Q 2 ... is a subword of the word of the form r\...r\X\r 2 ...r 2 y\r\...r\x 2 ... where ri 
stands for any ri(a), Xj G {ri2(a),r 21 1 | a € A}, yj € {^12 (a)" 1 , r 2 \ \ a £ A}. 

2. If none of the rules 6\,6 2 , ... is r 2 l, then none of the words w±, w 2 , wt is empty, and the 
product w\...wt is a freely reduced word. 

3. The word 9\...6 t is completely determined by the rule 6\ and the word w\w 2 ...w t . 

Proof. 1. The first statement is obvious. 

2. Statement 1 and Lemma 3.17 imply that the sequence w\,w 2 , ...,wt is a subsequence of 
the sequence 

ai(l), ...,ai(*i), (a (h + l)- 1 ^! + l)) ei ,a (ti + 2), ...,a (t 2 ), 
(a (t 2 + l)- 1 ^^ + l))\ai(t 2 + 2), ... 

where aj(j) G Af 1 , €j G {0,1}, 8j G {0,-1}, and we set -u = 1 for every word v. Moreover if 
rfi does not appear in the computation, none of the 6j and ej are equal to 0. Hence w\w 2 ...wt 
is freely reduced. 

3. The third statement follows from the form of the sequence (3.11). Indeed, if 9\ = r±(a), 
then wi = a± G Af 1 , and 9 2 is completely determined by the next one or two letters of the 
word w\w 2 ...wt. if the second letter is 61, then 2 = r\(b); if it is 6q 1 and the third letter is 
b±, then 9 2 = r± 2 (b); if it is bo and the third letter either does not exist or has index 0, then 
02 = f2i- Similarly for other choices of 6\, the second and the third letter of wiw 2 ...w 2 completely 
determine 6 2 . Now we can complete the proof by induction on i. □ 

Lemma 3.27. Suppose base(H / ) G {LpR,p~ 1 pR}, both W and f • W contain p(l)R (resp. 
p(3)R) and all a-letters in W , f • W are from Aq. Then f is empty. 

Proof. The statement can be proved in the same way as part 2 of Lemma 3.25 provided 
base(W) = LpR. 

Let base(H^) = p~ x pR, W = p(l)~ l up(l)R = f ■ W where all letters in u are from Af 1 
(the case of p(3) is similar). By Lemma 3.26, part 1, u = w~~ l uw where w = w\w 2 ... where 
Wj's are determined by the formulas from Lemma 3.17, the product w\w 2 ... considered as a 
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word in the alphabet (Ao U ^4i) U {1} (i.e. we do not throw away the empty factors) does 
not contain subwords xx^ 1 , xlx^ 1 or 11. By Lemma 3.17, ...W2W1 = 1. Then, by Lemma 3.17, 
since all letters of u are from Aq, all rules in / are from {r 2 (a), r 3 (a), r^ 1 , -rj^ 1 }. But in that 
case Wj = wj 1 by Lemma 3.17. Therefore the product ...W2W1 is reduced. Since that product 
is 1, wi, W2, w±, W2, ... are empty words. Since the factorization w\W2--- does not contain 11, 
we conclude that either / is empty or / = r^ 1 (by Lemma 3.17). The second option is clearly 
impossible. □ 

We shall need the following general statement from [8]. 

Lemma 3.28 ([8], Lemma 8.1). For arbitrary elements u,v,w of F and any integer t > 0, 
the length of an arbitrary product u 3 wv 3 in F is not greater than 2(\u\ + \v\ + \w\) + \u t wv t \ 
provided < j < t. 

Lemma 3.29. Suppose that one of the following conditions for an admissible word W of Z(A) 
is satisfied (there p = {p(l) , p(2) , p(3)} ) : 

1. W does not contain a p-letter. 

2. base(VF) = Lpp^ 1 ; 

3. base(VF) = pp~ l p; 

4. hase(W) = p~ l pR; 

5. hase(W) = LpR. 

Then the width of any computation 

W = W -^e Wi ... W t 
is at most Cmax(|W|, \Wt\) for some constant C. 

Proof. 1 is obvious: the length of the admissible word does not change during the computation. 

2. Let base(W) = Lpp^ 1 . Let W = Wo —>e W\ —>-g 1 ... —*e t -i Wt be a computation. Then 
Wi = LuiqiViq' 1 , where % G {p(l) , p(2) , p(3)} . It is easy to see that for each i = 0,...,t — 1, 
Uj+i = UiWi, Vi + \ = w~ 1 ViWi (equalities are in the free group) where Wi and Wi are determined 
by Lemma 3.17 (see (3.7), (3.8)). 

Therefore vdi is a letter from A^ 1 or 1, and the projection of Wi onto Ao is freely equal to 
Wi. In particular, \wi\ < \wi\. 

Since there is no R between p and p~ l , none of T{ is T2i- 

If 8{ = ri3, then n« = 0, and WQ...Wi-\ = Uq 1 . Therefore \v{\ <2|no| + |^o| < 3max(|uo|, \vq\)- 
Hence it is enough to assume that rules rf% do not occur during the computation and prove 

that in that case, say, \Wi\ < 7max(|Wo|, \ Wt\). 

We can assume that all rules 9i are in {ri(a), r2(a), ri2(a) ±1 } (the case when all 9i are of the 

form r^ is similar but easier). Then the history of the computation is a subword of a word of 

the following form (we write rj, r±2 instead of ri{a) and r±2(a)): 

ri...r 1 ri 2 r 2 ...r 2 r 1 2r 1 ...r 1 ... (3.12) 

Since the history is reduced, by Lemma 3.26, woW\...Wi, is a freely reduced word for every i. 
In addition \wow>i...Wi\ < \woWi...Wi\. Hence Si + |«o| + ko| + 3 > \Wi\ > i — \uo\ for every i. 
Therefore \Wi\ < 3i+\W \ < 3t+\W \ < |W |+3|W t |+3|«o| < 4|W |+3|Wt| < 7max(|W |, \W t \). 
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3. Let base(W) = pp 1 p. Let W = Wq —>$ W\ — ^ ... —>o t -i Wt be a computation, 
Wi = qiUiq^Viqi, where q { £ (p(l),p(2),p(3)}. 

Let Wj for some j be a longest word in the computation. Then we can assume without loss 
of generality that \Wq\ is shorter than any of the words W\, ...,Wj, and Wt is shorter than any 
of the words among Wj,...,W t . So assume that \W t \ < \Wj\ > \Wq\. 

Since the base of W does not contain L between p~ l and p or R between p and p -1 , rules 
r 2ii r t£ are excluded, so Tj G {ri(a), r 2 (a), r 3 (a), T 12 (a) ±1 }. As in part 2, we shall assume that 
r$ does not appear in the computation. 

By Lemma 3.17, u i+ i = WiUiW~ l , Vi + \ = w~ 1 ViWi where Wi = w{qi) and Wi = w{qi) are 
determined by formulas (3.7), (3.8). 

As in part 2, the history h = 0Q...9 t -i of the computation is a subword of the word of the 
form (3.12), the product woWiW2--.wt-i is a freely reduced word, and \u>j\ < \wj\ for every j. 

Therefore if |fj+i| > \vi\ for some i, then \vi\ < ... < \vt\ and Wt cannot be shorter than Wt—i'. 
indeed |iit— 1| — \ut\ < 2 by (3.8) from Lemma 3.17 and \vt\ — \vt—i\ > 2 since vt is a conjugate of 
vt-i- Similarly if < then Wo cannot be shorter than W±. Hence \vo\ = \v±\ = ... = \vt\, 

i.e. all Vi are cyclic shifts of v. This implies that the word WQ...wt-\ is periodic with period 
d < \v\. By Lemma 3.26, the history h of the computation is determined by its first letter 9\ 
and the word w = w\W2---- Every letter in w is contained in one of the Wi, so it corresponds 
to one of d{. Consider the letters in w number 1, d + 1, 2d + 1, .... These letters are the same 
since the word w is periodic with period d. Let D be the total number of rules in Z(A). Then 
among the first D + l rules corresponding to these letters, there are two equal rules. Since none 
of the words Wi contain two same letters, we can deduce that the word h is periodic with period 
di < {D + l)d. 

Therefore, by Lemma 3.17, the sequence wq, wt-i is periodic with the same period di. Let 
z = w^-i-.-Wq. Then Uj = ZjZ s uoz~ s zj 1 , ut = ztz s uqz~ s z^ 1 for some words zj,z t of length at 
most d\ and some s, s'. Now we can apply Lemma 3.28 and deduce that \uj\ < 2(2di + \uq\) + 
\ut\ + 4cZi < Ci(|uo| + |uo| + \u t \) for some constant C\. Therefore \Wj\ < C(\W \ + \W t \) for 
some constant C as required. 

4. We can assume that If is a shortest word in the computation. Using notation similar to 
Case 3, Wi = p~ l VipuiR and Uj+i = w^ 1 ViWi, Uj+i = WiUi. As in Case 3, we can assume that 
l^o I = |^i | = ... which implies, as before, that the history of the computation is periodic with 
period d\ < (D + l)\v\. The proof can be completed as in Case 3, using Lemma 3.28. 

Case 5 follows from Lemma 3.25. □ 

3.7 A composition of S and the adding machine 

Now let us define a "composition" S o Z of S and Z(A). Essentially we insert a p-letter between 
any two consecutive ^-letters in admissible words of S, and treat any subword ki...p...ki + \ as an 
admissible word for Z(A). 

First, for every i = 0, TV — 1, we make two copies of the alphabet Yi of S (i = 1, iV — 1): 
Yifi = Yi and Y^i. The set of state letters of the new machine is 

Ki U Pi U K 2 U P 2 U ... U Pjv_i U K N 

where Pi = {pi,Pi(6, l),pi(6,2),pi(6,3) \ 6 G 0}, % = 1, ...,N — 1. We shall denote the compo- 
nents of this union by Qi, Q 2 N-i- 
The set of state letters is 

y = (Y lfi u y 1a ) u Y lt0 u (y 2)0 u y 2) i) u y 2)0 u ... u (Yv-1,0 u Y N _ hl ) u yjv_i )0 ; 
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the components of this union will be denoted by Y\, ...,Y 2 n-2- 

The set of positive rules of S o Z is a union of the set of suitably modified positive rules 
of <S and 2(iV — 1)|0| copies Zi(9) + {9 G @,i = 1, ■ ■■,N) of positive rules of the machine Z(Yi) 
(also suitably modified). 

More precisely, every positive rule 9 G + of the form 

[k\U\ — > k' 1 U 1 ,Vik 2 U2 — > v' x k' 2 U 2 , ...,VN-lkN — ► fjV-lkjv] 

where fcj, A;^ G .fQ, Uj and i>j are words in 1", is replaced by 

x [fciui -> /c^n^fipi -> v' 1 pi(9,l),k 2 u 2 -> k' 2 u' 2 ,..., 
v = ^ 

■U7V_iPat_i -»■ ^_ 1 pat-i(6', 1), % -»■ fe^] 

with r 2 i-i(0) = ^i,o(<9) and Y 2 i = for every i. 

Thus each modified rule from turns on N — 1 copies of the machine Z(-A) (for different 
A's). 

Each machine is a copy of the machine Z(Yi) where every rule r = [C/i — ► V\,U 2 — > 

V 2 ,U3 — > V3] is replaced by the rule of the form 



1) p a (0, 1), s = i + 1, N - 1 



where J7i, U 2 , U3, V\, V 2 , V3 are obtained from Ui, U 2 , U3, Vi, V 2 , V3, respectively, by replacing p(j) 
with Pi(9,j), L with fc^ and with k' i+1 , and for s ^ i, Y 2s _i(fi{9)) = Y^. 

In addition, we need the following transition rule Q{9) that returns all p-letters to their 
original form. 

% - k' i ,p j {9,Z)^p j ,i = l,...,N,j = 1,...,N-1]. 

Thus while the machine works all other machines Zj(.), j / i, must stay idle (their 

state letters do not change and do not move away from the corresponding fc-letters). After the 
machine Zi(9) finishes (i.e. the state letter pi(9,3) appears next to fcj+i), the next machine 
Zi + \(9) starts working. After all p-letters have the form pj(9,3) we can apply £(#) and turn 
all pj (r, 3) into pj . Thus in order to simulate a computation of S consisting of a sequence 
of applications of rules 9\,9 2 , ...,9 S , we first apply all rules corresponding to 9±, then all rules 
corresponding to 9 2 , then all rules corresponding to #3, etc. 

The modified rules 9 of S will be called basic rules. We shall call basic rules, transition rules 
C(9), and their inverses main rules. 



3.8 Properties of the S'-machine S o Z 

Notation. For every word W, we shall denote the number of ci-letters in W by |T^| a . 

Every word in the alphabet {Qi, ■ ■■,Q 2 N-i} is called a base word. Let B be a finite set of 
base words. 

Definition 3.30. We call a base word B-covered if 

• it is covered by bases from B (i.e. every letter belongs to a subword from B), 
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• it starts and ends with the same g-letter x. 

We call a base word w B-tight if it has the form uxvx where xvx is a ^-covered word, w does 
not contain any other ^-covered subwords. A base word is called B-narrow if it does not contain 
^-covered subwords. 

Lemma 3.31. There exists a finite set of base words B such that 

(*) the length of every B-narrow base is smaller than a constant Kq, 

(**) for every admissible word W with base from B the width of every computation W — ► W\ — > 
... — ► Wt does not exceed C(\W\ + \ Wt\ + log 2 i) for some constant C. 

Proof. Let B be the set of bases of the form (kiqiki + \) ±l , (g~ 1 %A;i + i) ±1 , (A^g" 1 ) 1 * 11 , hk~ x , 
K 1 ^, (QiQT 1( li) ±1 (<li = {Pi(QA),Pi(Q,2),Pi(Q,3),Pi})- Lemma 3.9 implies that condition (*) 
holds for B. Indeed, base words of the form {qiq^ 1 qi) ±1 and all base words that start and end on 
the same fc-letters and do not contain (qiq~ qi)^ 1 as a subword are ^-covered, and every word 
of length at least 6./V contains one of these ^-covered words. 

Let W be an admissible word with one of the bases from B. By passing to W^ 1 if necessary, 
we can assume that base(W) has one of the forms fciftfcj+i, q^tftfcj+i, kiqiq^ 1 , kikj 1 , k~ x ki, 

Let / = /1/2/3 where /1, f% contain no basic and transition rules or their inverses, / 2 starts 
and ends with a basic or transition rule or its inverse (one or more of these subwords may be 
empty) . 

Case 1. Suppose that / = /1, so / does not contain main rules. 

Notice that we can represent / as a product z\z?....z s where each z\ is a history of computation 
of one of the machines Zj^(6) for some and j(i) depending on i, j(i) / j(i + 1). Since f ±l 
does not contain basic and transition rules, all #'s are the same. By Lemma 3.25, 3.29, if 
base(VF) {kipki + i,p~ 1 pki + i} 1 then s = 1, the width of the computation does not exceed 
Cmax(W, / • W) for some constant C and no basic or transition rule can apply to / • W . 

If W = kiupvki + \ or W = p~ 1 upvki+\, then each subcomputation corresponding to Zj either 
does not affect the admissible word, or it is essentially a computation of Z(Yi) (with L replaced 
by hi, R replaced by fcj+i, etc.). In the first case the width is equal to the length of W. In 
the second case, the width of the subcomputation corresponding to Zj does not exceed C times 
the sum of lengths of the beginning and ending words of the computation (by Lemma 3.29). 
Moreover in the second case the end word of the subcomputation, except possibly for / • W, has 
the form kiwp'ki+i (resp. (p')~ 1 wp'ki + i) for some word w in Y^. By Lemma 3.18, the words 
w and uv (or w and v^uv ) must have the same projections onto Y\. Therefore all w/'s have the 
same lengths which does not exceed max(|W|, \ f ■ W\) (resp. 2max(|II / |, |/- W|)). Therefore the 
width of the whole computation does not exceed C(|W| + \f ■ W\). Moreover we can conclude 
that if / ■ W contains a 2-letter subword pR, then \f ■ W\ < \W\. 

Case 2. Suppose that fi is of length 1, i.e. it contains exactly one main rule. Then the base 
of W is kiqiki + \ or q^ 1 qiki + i. We can repeat the argument from the last paragraph of Case 1, 
applied to the subcomputations corresponding to f± and /3" 1 . Note that a main rule f^ 1 must 
be applicable to fi ■ W and to f^ 1 ■ (f ■ W)), hence these words must contain subwords of the 
form pki + \ and all a- letters must belong to Yip. Hence the lengths of the end words of these 
subcomputations cannot be bigger than 2\W\ or 2\f ■ W\ by Lemma 3.18. 

Case 3. Suppose that / 2 contains at least two main rules. 

Again base(iy) must belong to {kiqiki+i, q^qiki+i} for some i. By the argument in Case 1, 
|/i • W\ < 2\W\, I/1/2 • W\ < 2\f • W\, so we can assume that f\ and f% are empty, thus / starts 
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and ends with a main rule. We can assume that this computation has minimal length among 
all computations connecting W and / • W and having the same width. 

Let rgr' be a subword of / where r, r' are main rules and g does not contain main rules. 
Then either g is empty or g = g\...g m and each gj is a non-empty product of rules of one of the 
machines Z s .(0j), Sj / Sj+i- 

Suppose that g is not empty and Sj = i for some j. In this case we shall call the subword 
rgr' active. Let / = fgtf" for some /',/". Let W = f ■ W, W" = 9j ■ W. Then both W 
and W" must contain subwords ph + \ where p € {Pi(-, l),Pi(., 3)} and all a-letters of W',W" 
are in Y^q. Since gj is not empty, by Lemma 3.27, either W' contains pi(., 1) and W" contains 
Pi(.,3), or W' contains Pi(.,S) and W" contains pi(., 1). Since rule r^ 1 cannot be applicable to 
admissible words of Z(A) of the form p~ 1 pR, we conclude that base(VF) = kiqiki+i. By Lemma 
3.25, the length of gj is at least 2^ w I -3 (here \W'\ — 3 is the number of a-letters in W), and the 
lengths of the words in the subcomputation W' W" are the same. 

If none of Sj are equal to i, then we call the subword rgr' passive. In that case all rules of g fix 
admissible words with base base(W). Therefore all words in the subcomputation corresponding 
to g are the same. From the definition of main rules, then either r' = r _1 , or r is the inverse 
of a basic rule and r' is another basic rule, or r is a transition rule and r' is the inverse of 
another transition rule. If r' 7^ r _1 , g is non-empty, and the subcomputation corresponding to 
g does not contain the longest word in the computation, we can use Lemma 3.12 and remove 
the subcomputation corresponding to g and obtain a shorter computation of the same width 
connecting W and / • W. That would contradict the assumption that / is the shortest possible. 
Hence we can assume that if rgr' is passive and g is not empty, and the subcomputation 
corresponding to g does not contain the longest word in the computation, then r' = r _1 . 

Notice also that we have proved, that in our computation, \Wj\ = |Wj+i| unless the j + 1-st 
rule of the computation is a main rule. 

Let J = \f[t] ■ W\ be the width of the computation W —>...—►/• W. We can assume (by 
the remark made in the previous paragraph) that the t-th rule in / is a main rule. For every 
admissible word U with base(U) = base(W) and every basic rule r the difference \U\ — \t ■ U\ is 
bounded from above by a constant c. We can assume that J — \W\ > 4c. Let to be such that 
J — \f[to] -W\ > 4c and t — to is minimal possible. Then the suffix of f[t] obtained by removing 
/[to] has the form / = T\g\T2g2---TjgjTj + i such that 

• 3 > 3, 

• all 77 are main rules, all g\ do not contain main rules, 

• all admissible words in the subcomputation corresponding to / are of length between J— 4c 
and J, 

• one of the admissible words in that subcomputation has length at most J — 4c. 

If one of the subwords TigiTi + \ in / is active then the length of / is at least 2 J ~ Ac ~ 3 . So 
J < log 2 (/j + 4c + 3 and (**) holds. 

Thus we can assume that all subwords t^/Tj+i are passive. But then, as we proved before, 
for every I either gi is empty or 77+1 = t^ 1 . Moreover if gi is empty then 77+1 cannot be 
the inverse of a basic rule or a transition rule. Hence gi + \ cannot be empty. Therefore the 
lengths of admissible words in the subcomputation corresponding to / can differ by at most 2c, 
a contradiction with the fact that one of these words has length J and the length of another one 
is at most J — 4c. □ 
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From now on we shall fix the set B of base words from Lemma 3.31, and a constant K such 
that 

K > 2K . (3.13) 



4 Properties of the group S o Z 

Let 0+ be the set of positive rules of S o Z. 

We redefine the map a and the stop word W (see Section 3.5) as follows: 

<7» = fcl(l)vp(l)fc2(l)p(2)fc 3 (l)...p(JV - 1)M1) 

for every word v over X, 

W' = fci(0)p(l)fc 2 (0)p(2)A;3(0)...p(JV - 1)^(0). 
Recall that £ is the language recognized by 5. 

Lemma 4.1. Lei Wi = cr'(v) and W\ — ^ ... — >8 t -i W* be any computation of S o Z . Let 
6i 1 ,...,6i s be all basic rules or their inverses in that computation. Let 9^ be the rule of S 
corresponding to 0j . . For each j let Wj be the (natural) projection of Wj on the alphabet of q- 
and a-letters of S. Then there exists a computation W\ — > ... — > Wi o/ i/ie machine S, whose 
history is the reduced form of the word 9±...9 S . 

Proof. This immediately follows from Lemmas 3.18, 3.12 and the definition of S o Z. □ 

Lemma 4.2. A word v over X belongs to C if and only if o~'(v) and W' are conjugate in S o Z . 

Proof. Suppose that o~'{v) is a conjugate of W' . Then, by Lemma 3.13, there exists a com- 
putation a'(v) — > ... — > W"' of 5 o Z. Then, by Lemma 4.1, there exists a computation 
a'(v) — > ... — > W"'. Note that cr'(w) = cr(v), W"' = W". Hence f is accepted by S. Hence 
v e C. 

Conversely suppose that v £ C. Then, by Lemma 3.16, there exists a computation of 

5 connecting a(v) with W and consisting of positive words. Then, by Lemma 3.22 (applied 
several times), there exists a computation of SoZ connecting cr'(v) and W, so by Remark 3.14, 
a'(v) and W' are conjugate in S o Z. □ 

Let L be the maximal length of a defining relation of S o Z. 

Lemma 4.3. Let T be a 9-band with base of length l^. Let l a be the number of a- edges in the 
top path top(T). Then the length ofT is between l a — (L — and l a + (L + 1)1^. 

Proof. Every a-letter in the label of top(T) labels an edge of a (a, 0)-cell or on a (q, 0)-cell. 
Every (q, 9)-cel\ in T has at most L a-edges lying on top(T), and every (a, 0)-cell has at most 
one a-edge lying on top(T). At most L of these a-edges can belong to the same (g, 0)-cell. 
Hence \T\ > l a — Ll^ + 

The number of (a, 9)-ce\ls having no common a-edges with top(T), does not exceed Ll^ 
because at least one of the two a-edges of this cell is glued to an a-edge of a (g, 0)-cell. This 
proves the inequality \T\ < l a + Ll^ + ij. □ 

Lemma 4.4. Let IS. be a trapezium of height h > 1 whose base is B-tight and B-covered. Then 
the area of A does not exceed Ch(\W\ a + \ W\ a + log/i + 1), where W,W are the labels of its 
top and bottom, respectively for some constant C. 
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Proof. Notice that the number of letters in the base of A does not exceed Kq + 1. Since base(W) 
is covered by bases from B, A is covered by subtrapezia whose width, by Lemmas 3.31 and 4.3, 
does not exceed a constant times \W\ a + \ W'\ a + log/i + 1, height does not exceed h, and the 
number of these subtrapezia does not exceed a constant (since B is finite and from every cover of 
base(Ty) by words from B, we can find a subcover which covers every letter of base(VF) at most 
a constant number of times). Hence the width of A does not exceed C(| W| a + | W'\ a + log h+ 1) 
for some constant C. Since, by Lemma 4.3, the area of A does not exceed a constant times the 
hight times the width, the statement of the lemma follows. □ 

4.1 A modified length function on S o Z 

Let us modify the length function on the group S o Z. The standard length of a word (path) 
will be called its combinatorial length. From now on we use the word length for the modified 
length. As before let L be the maximal length of a defining relation of S o Z. We set the length 
of every q- letter equal 1, and the length of every a-letter equal a small enough number S so that 

2 - \U - LK5 > 5. (4.14) 

We also set to 1 the length of every word of length < L which contains exactly one ^-letter 
and no g-letters (such words are called (6, a)- syllables). The length of a decomposition of an 
arbitrary word in a product of letters and (0, a)-syllables is the sum of the lengths of the factors. 
The length of a word w is the smallest length of such decompositions. The length of a path in 
a diagram is the length of its label. The perimeter \dA\ of a van Kampen diagram is similarly 
defined by cyclic decompositions of the boundary dA. It follows from this definition that for 
any product s = S1S2 of two paths in a van Kampen diagram, we have 

\si\ + | «2 1 > \s\ > \si\ + \s 2 \ - L5 (4.15) 

A maximal #-band of a van Kampen diagram A is called a rim band if its top or its bottom 
side lies on the contour d A. 



Lemma 4.5. Let A be a van Kampen diagram whose rim band T has base with at most K 
letters. Denote by A' the subdiagram A\T. Then \dA\ — \dA'\ > 5. 

Proof. Let s be the top side of T and s C d A. Note that by our assumptions the difference 
between the number of a-edges in the bottom s' of T and the number of a-edges for s cannot 
be greater than LK. Hence \s'\ — \s\ < LK5. However, A' is obtained by cutting off T along s', 
and its boundary contains two #-edges fewer than A. Hence we have |so| — l s ol — 2 — 2LS by 
(4.15), for the complements so and s' Q of s and s', respectively, in the boundaries dA and dA'. 
Finally, 

|<9A| - |0A'| > 2 - 2L5 - LK5 - 2L5 > 5 
by (4.14) and (4.15). □ 

The definition of length has also the following obvious consequence. 

Lemma 4.6. Let s be a path in a diagram A having c 9-edges and d a-edges. Then 

(a) \s\ > max(c, c + (d — Lc)S); 

(b) \s\ = c if s is a top or a bottom of a q-band. 
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4.2 Combs 

Definition 4.7. We say that a reduced diagram T is a comb if it has a maximal (/-band Q (the 
handle of the comb), such that 

(Ci) bot(Q) is a part of dT, and every maximal 6*-band of T ends at a cell in Q. 

If in addition the following properties hold: 

(C2) one of the maximal #-bands T in T has a 0-tight base and 

(C3) other maximal #-bands in T have ,6-tight or i3-narrow bases 

then the comb is called tight. 

The number of cells in the handle Q is the length of the comb, and the maximal length of 
the bases of the 6>-bands of a comb is called the basic width of the comb. 




Figure 1: Comb. 

Notice that every trapezium is a comb. 

Definition 4.8. Let Q be a maximal g-band of A. Then bot(Q) divides A into two parts. The 
part containing Q is called the top part of A with respect to Q. The other part is called the 
bottom part of A with respect to Q. 

Lemma 4.9. Let IS. be a reduced diagram with non-zero area. Assume that every rim band of 
A has base of length at least K . Then there exists a maximal q-band Q in A such that the top 
part r of A with respect to Q is a tight comb. 

Proof. Let Tq be a rim band of A. Its base w is of length at least K, and therefore w has disjoint 
prefix and suffix of lengths Kq since K > 2Kq by (3.13). The prefix of this base word must have 
its own B-tight subprefix w±, by Lemma 3.31, part 1, and the definition of ,6-tight words. A 
g-edge of % corresponding to the last (/-letter of the w\ is the start edge of a maximal g-band Q! 
which bounds a subdiagram V' containing a band T (a subband of %) satisfying property (C2). 
It is useful to note that a minimal suffix W2 of w, such that w^ 1 is tight, allows us to construct 
another band Q" and a subdiagram T" which satisfies (C2) and has no cells in common with T'. 

Thus, there are Q and T satisfying (C2). Let us choose such a pair with minimal area(r). 
Assume that there is a #-band in F which does not cross Q. Then there must exist a rim band T\ 
which does not cross Q in T. Hence one can apply the construction from the previous paragraph 
to l~i and construct two bands Q\ and Q2 and two disjoint subdiagrams Ti and T2 satisfying 
the requirement (C2) for T. Since Fi and T2 are disjoint, one of them, say Ti, is inside T. But 
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Qi Q 2 Q 



Figure 2: Lemma 4.9. 

the area of Ti is smaller than the area of T, and we come to a contradiction. Hence T is a comb 
and condition (Ci) is satisfied. 

Assume that the base of a maximal #-band T of T is not narrow. Then it has a /3-tight 
proper prefix (we may assume that T terminates on Q), and again one obtain a g-band Q' in T, 
which provides us with a smaller subdiagram V of A, satisfying (C2), a contradiction. Hence T 
satisfies property (C3) as well. □ 

Lemma 4.10. Let I and b be the length and the basic width of a comb T and let Ty,...Ti 
be consecutive 9-bands of T (as in Figure 1). We can assume that bot(7i) and top(7/) are 
contained in dT. Denote by a = \dT\ a the number of a- edges in the boundary ofT, and by a\ 
the number of a-edges on bot(Ti). Then a + AClb > 2u\, and the area of V does not exceed 
2Cbl 2 + 2al for some constant C . 

Proof. For every i, top(7i) and bot(7i + i) can have an initial segment in common. Let ai and ol i 
be the numbers of the a-edges in bot(Ti) and top(7i) respectively that belong to the boundary 
of r. 

Let rii be the length of 7^. It follows from Lemma 4.3 that |nj+i — ni\ < 2Cb + ci i + CKj+i. 
Since n\ < Cb + a\ by the same lemma, we have for any i, 

rii < (Cb + a) + 2C(i - 1)6 + ct x + a 2 H h a-_ x + a { < 2Clb + 2a 

This inequality provides us with a required upper bound for the area Yl\=i n «- Finally) 

a - oti > Y!i^\ a i + Y!i=2 a i + a 'i> Etl \ n i+i - n i\ - 2Cb(l - 1) + (ni - Cb) 
> EtiC"* - n »+i) - 2Cb ( l - 1) + (ni - Cb) > m - 3Cbl >a ± - ACbl. 

□ 

5 Dispersion of bipartite chord diagrams 

Definition 5.1. Recall that a chord diagram is a system of chords in a disc such that the 
intersecting point of any two chords is in the interior of the disc. We can consider the intersection 
graph of a chord diagram whose vertices are chords and two chords are connected when they 
intersect. If that graph is bipartite, the chord diagram is called bipartite. We shall always 
consider bipartite chord diagrams (BCD for short) with fixed subdivision into two parts T and 
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Q, so chords from one of the parts do not intersect. The intersection points of T- and Q-chords 
are the nodes of the BCD. 

For an integer K > 1, we fix a -fT-tuple a of numbers < a.\ < «2 < ••• < olk = 1- For every 
node o on a T-chord T and choice of the left-right direction on T, we assign the weight a/ where 
I is the minimum of K and the number of nodes on T to the left of o (including o). Thus the 
weight of a node o is smaller if it is closer to the boundary (according to the chosen direction 
on the T-chord containing o). Note that we assign two weights to each node o according to the 
two possible directions on a chord containing o. 

Let C be a Q-chord. Let (01,02) be a pair of nodes from C. Fixing an orientation from 
01 to 02 on C, we determine a left-to-right orientation on every T-chord crossing C. The node 
Oi (i = 1,2) lies on some T-chord %. Let o^ be the next node to the left of o, on % or the 
intersection point of % and dD if there are no nodes on % between Oj and dD to the left of Oj 
(according to the chosen direction of %). If either both o[ lie on the same Q-chord or both lie 
on dD, then we call the pair (01,02) good. Otherwise we call the pair (01,02) bad. By definition, 
the weight of the pair (01,02) is the product of their weights (corresponding to the direction 
from 01 to 02 on the Q-chord). We shall set the weight of every good pair to 0. Note that a pair 
(01,02) may be good while the pair (02,01) is bad. 




Figure 3: (01,02) is a bad pair, (02,01) is a good pair. 

The a- dispersion T> a (C) of the system of T-chords on a Q-chord C is the sum of weights of 
all bad pairs of nodes (01,02) such that 01 and 02 lie on C. For example, let 1 be the 1-tuple of 
numbers (1). Then V 1 (C)= I for Figure 3. 

Definition 5.2. The sum of dispersions T> a (C) over all Q-chords C is called the a-dispersion 
V a {G) of the BCD Q. 

Clearly V\{Q) is the number of bad pairs of Q, and T> a (Q) < V\{Q) for every a. 
We need a quadratic upper bound for the a-dispersion V of a BCD Q in terms of the number 
of T-chords in Q. 
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Figure 4: Lemma 5.3. The chords 71, %, are in Xk, T4 are in Xk+i- 



Lemma 5.3. Let r be the number of T-chords of a BCD Q on a disc D. Then the 1-dispersion 
'DiiG) of the BCD Q does not exceed r 2 — r. 

Proof. We will induct on the number of T-chords in Q. If the set T is empty, then T>i(Q) = 0, 
and the statement is obviously true. 

Thus, we may assume that there is a chord T in T. We can assume that T is close to the 
boundary that is there are no T-chords in one of the half-discs obtained by cutting D along 
T. We denote by C\, . . . ,Ci all the Q-chords intersected by T, where I > 0. We enumerate and 
orient the Q-chords so that C2 is to the right of Ci, C3 is to the right of C2, etc., and there are 
no nodes on Ck above Ok = CkHT, for each k = 1, I. 

Let Xk be the set of T-chords T of Q such that the pair (T n Ok) is bad. Denote by Lk 
the number of chords in Xj.. Thus is the number of bad pairs of the form (o, o&). 

Note that a chord T from Xk+i cannot intersect Ck because otherwise the pair (T' nCk,Ok) 
would be good. Hence the sets X^ are pairwise disjoint, and ^ < r — 1 because T does not 
belong to any Xk- 

Similarly let Rk be the number of bad pairs of the form (ok, o). Then as above, the sum of 
all Rk does not exceed r — 1. Hence 



Consider the BCD Q$ obtained from Q by deleting the T-chord T. Notice that every bad 
pair (c/, o") in Qq is also bad in Q and the difference T)\(Q) — T>\ (Go) between the number of bad 
pairs for Q and the number of bad pairs for Qq is the number of bad pairs of the forms (o,Ok) 
and (ofc,o) which does not exceed 2(r — 1) by (5.16). By the inductive assumption, £>i((?o) does 
not exceed (r — l) 2 — (r — 1). Hence D\(Q) does not exceed 




(5.16) 



(r - l) 2 - (r - 1) + 2(r - 1) = r 2 - r. 
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Figure 5: Lemma 5.5 



□ 

Remark 5.4. The estimate r 2 — r in Lemma 5.3 is optimal: for every r one can easily construct 
(using the proof of Lemma 5.3) a BCD with r T-chords whose 1-dispersion is exactly r 2 — r. 

Lemma 5.5. Let a BCD Qq be obtained by a deleting (a) a T-chord or (b) a Q-chord C of a 
BCD Q. Then the a-dispersion of Qq does not exceed the a-dispersion of Q . 

Proof, (a) If we remove a T-chord, then every bad pair in Qq is a bad pair in Q having the same 
weight. This implies part (a). 

(b) Suppose C is a Q-chord. Then the weight of any pair (01,02) in Qq cannot exceed the 
weight of that pair in Q. Indeed, if the neighbor o[ of o, L in Q did not belong to C, then the 
weight of that neighbor is the same or smaller in Qq (since that neighbor can only become closer 
to the boundary). If o[ belongs to C and its weight in Q is a/., then the weight of the neighbor 
of Oj in Q cannot be bigger than otk (because the neighbor of Oj in Qq is closer to the boundary 
than the neighbor of oi in Q). 

Let Xq be the set of bad pairs in Qq. Then Xq is the union of the set Y\ of all bad pairs of 
Q which do not belong to C (the weight of that pair in Qq does not exceed its weight in Q, we 
noted that in the previous paragraph) and the set Y2 of good pairs (01, 02) of Q whose neighbors 
o[,o' 2 belong to C but the pair (o^, o' 2 ) is bad in Q (the weight of (01,02) in Qq equals the weight 
of (o'i,o 2 ) in Q). Since (o'i,o 2 ) is uniquely determined by (01,02), the total weight of the pairs 
from Xq = Y\ U Yi in Qq does not exceed the total weight of bad pairs from C in Q. Hence 
D a (Qo) < V a {Q). □ 

Definition 5.6. We say that a Q-chord C is close to a Q-chord C of a BCD Q if every chord 
crossing C also crosses C. 

From now on let 

( l 2 u 
a = (-,-,..., 1). 

We shall write V(Q) instead of V a (Q) and call it simply dispersion of the grading Q. 

Lemma 5.7. Suppose that a Q-chord C is close to a Q-chord C in a BCD Q of a disc D. Let 
Dq be one of the two subdiscs of the disc D divided by C that contains C . Suppose that the 
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number of nodes from Dq on any T- chord in Q is at most K . Denote the numbers of nodes on 
C and C by I and V , respectively. Let Qq be the BCD obtained from Q by removing C . Then 

v(g )<v(g)-^i'(i-i>). (5.17) 

Proof. Let V\ be the set of nodes on C that belong to T-chords intersecting C , V2 be the set of 
other nodes on C. Then |Vi| = I', | V2 1 =1 — 1', and the number of pairs (0,0') of nodes on C 
such that C is to the left of C according to the direction 0^0' and either (o 6 V\ and d e V2) 
or [o e V 2 and d G V{), is - I'). 

Consider one of such pairs, (o, d). Let o = oq, o±, 02, • •• be a sequence of nodes from Dq on 
the T-chord T passing through o. Similarly consider a sequence of nodes d = o' Q , o'^o^— from 
Dq on a T-chord T . Since one of the nodes o or d is in V\ and another one in V2, there exists 
i = 0, 1, ... such that (oj, o£) belongs to some Q-chord C« / C and is bad. 

The weight in Qq of the node from the pair (oj, dj) that belongs to a T-chord that does not 
cross C, is the same as its weight in Q and is at least j^. The weight of the other node from that 
pair decreases by at least when we pass from Q to Qq since by the assumption the number 
of nodes on every T-chord in Dq is at most K and we removed C that was to the left of C{. 
Hence the weight of the pair decreases by at least Since the number of such pairs (oj, o£) is 

— I'), the total weight of such pairs decreases by at least — I'). 

Now as in Lemma 5.5 let Xq be the set of bad pairs in Qq. Then Xq is the union of the 
set Yi of all bad pairs of Q which do not belong to C and the set Y2 of good pairs (01,02) of Q 
whose neighbors o[, d 2 belong to C but the pair (o^, o' 2 ) is bad in Q (the weight of (01,02) in Qq 
equals the weight of (o^, o 2 ) in Q). As in the proof of Lemma 5.5 the weight of any pair of Y\ 
in Qq does not exceed its weight in Q. Since all pairs (oj, d { ) that we considered in the previous 
paragraph belong to Yi, we get the inequality (5.17). □ 

6 The upper bound of the Dehn function 

Lemma 3.3 implies that we can associate a bipartite chord diagram Q to any reduced diagram 
A, where T-chords are the medians of the maximal 0-bands, and Q-chords are the medians of 
g-bands. 

Remark 6.1. If A is not a topological disc then <9A can be transformed into a circle by an 
arbitrary small deformation, and so the topological structure of Q is well-defined. 

The dispersion of this BCD is called the dispersion T> = T>(A) of the diagram A. 

In the following lemma, we estimate the area of a van Kampen diagram over S o Z in terms 
of its perimeter and dispersion. Namely we show that for some constant M the area of any 
reduced diagram A of perimeter n does not exceed Mn 2 log n + MV(A) for some constant M. 
(Then using the quadratic upper bound for T>(A) we will deduce that the area is bounded 
by M'n 2 log n for some constant M' .) Roughly speaking, we are doing the following. We use 
induction on the perimeter of the diagram. First we remove rim 0-bands (those with one side 
on the boundary of the diagram) with short bases. This operation decreases the perimeter 
and preserves the sign of Mn 2 log n + MP(A) — area(A), so we can assume that the diagram 
does not have such bands. Then we use Lemma 4.9 and find a tight comb inside the diagram 
with a handle C. We also find a long enough (/-band C that is close to C. We use a surgery 
which amounts to removing a part of the diagram between C' and C and then gluing the two 
remaining parts of A together. The main difficulty is to show that, as a result of this surgery, the 
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perimeter decreases and the area and the dispersion change in such a way that the expression 
Mn 2 logn + M2?(A) — area(A) does not change its sign. In the proof, we need to consider several 
cases depending on the shape of the subdiagram between C and C. Note that neither Mn 2 logn 
nor MV(A) nor area(A) alone behave in the appropriate way as a result of the surgery, but the 
expression Mn 2 logn + MV(A) — area(A) behaves as needed. 

Let us take a big enough constant M. Here "big enough" means that M satisfies the 
inequalities used in the proof of Lemma 6.2 (i.e. (6.20), (6. 21), (6.22), (6.31), (6.35), (6.37), 
(6.38), (6.43)). Each of them has the form M > C for some constant C that does not depend on 
M (but depends on the constants introduced earlier), and the number of inequalities is finite, 
so the choice of M is possible. 

Lemma 6.2. The area of a reduced diagram A does not exceed Mra 2 log'n + MV(A) where 
n=\dA\, and log' n = max(log 2 n, 1). 

Proof. Arguing by contradiction, we consider a counter-example A with minimal perimeter n. 
Of course, its area is positive, and, by Lemma 3.3, we have at least 2 0-edges on the boundary 
<9A, and so n > 2. 

Step 1. Assume that there are two Q-chords C and C, where C is close to C in the BCD 
G(A), and suppose the number I' of the nodes on C' does not exceed a half of the number I of 
nodes lying on C. Let X and X' be the g-bands corresponding in A to C and C, respectively. 
Without loss of generality assume that the top subdiagram T with respect to X contains both 
X and X' and the top subdiagram T' < T with respect to X' does not contain X. We suppose 
in addition that V is a comb with handle X'. Let us prove that then the basic width b of the 
comb r' is not smaller than K, and in particular the comb cannot be tight. 

By contradiction suppose that b < K. 

We note that the bands X and X' contain / and /' cells, respectively. It follows from Lemma 
4.10 and our assumptions, that the area of T' does not exceed C\{1') 2 + 2al' for a = \T'\ a and 
some constant C\. 

Let A' be the diagram obtained by deleting the subdiagram V from A. Since the boundary 
of A' has at least two g-edges fewer than A, we have |<9A'| < |<9A| — 2. Moreover, we have from 
Lemma 4.6 and Lemma 3.3 that 

|3A| - |0A'| > 7 = max(2, 5{a - LI')) (6.18) 

because the top or the bottom of X' has at most LI' a-edges. 

The difference of the dispersions T>(A) — V(A') is at least — V) by Lemmas 5.7 and 

5.5. Hence T>(A) — V(A') > -^(l') 2 as I' < I — I'. This inequality and the inductive assumption 
related to the area of A', imply that the area of A' is not greater than 

M 

M{n - 7 ) 2 log'(n - 7 ) + MV(A) - ^(l') 2 . 

Adding the area of V, we see that the area of A does not exceed 

Mn 2 log'n + MV(A) - M 7 nlog'n - ^(l') 2 + Ci(l') 2 + 2a/'. 

K 

(Keep in mind that 7 < n.) This will contradict the choice of the counter-example A when we 
prove that 

-M 7 n log' n - ^{l'f + C\(l') 2 + 2al' < (6.19) 
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Consider two cases. 

(a) Let a < 2LI'. Then inequality (6.19) follows from the inequality 

M > 2K 2 (C\ + 4L). (6.20) 

(b) Assume that a > 2LI'. Then by (6.18) we have 7 > ^5a and Mjnlog' n > 2al' since 
n > 21' by Lemma 3.3, and 

M > 2<T 1 . (6.21) 

Since jl?(l') 2 > Ci(l') 2 by (6.20), the inequality (6.19) follows. 

Step 2. Assume that A has a rim #-band T whose base has s < K letters and top (7") is 
in <9(A). By deleting 7", we obtain, by Lemma 4.5, a diagram A' with |<9A'| < n — 5. Since 
top(T) lies on dA, we have from the definition of the length (Section 4.1), that the number 
of a-edges in top(T) is less than 5~ l (n — Ls). By Lemma 4.3, the length of T is at most 
(L + l)s + <5 -1 (n — Ls) < 5 _1 n since <5 _1 > by (4.14). Thus, by applying the inductive 
hypothesis to A', we have that area of A is not greater than M(n—5) 2 log'(n— 5)+MT)(A)+5~ 1 n 
because V(A') < V(A) by Lemma 5.5 (a). But this sum does not exceed Mn 2 log' n + MV(A) 
provided 

M > 5~ 2 . (6.22) 

This contradicts the choice of A. Hence the base of every rim #-band of A has more than 
K letters. 

Step 3. Now we can apply Lemma 4.9. By that lemma, there exists a tight comb T < A. 
Let T be a #-band of T with ,6-tight base. 

In particular, the basic width of T is smaller than K. Since the base of T is £>-tight, it is 
equal to uxvx for some x = Qi (we read the base starting at the boundary of A) where base(xvx) 
is ^-covered. 

The second occurrence of x in uxvx corresponds to the last cell of T belonging to the x- 
band Q. Let Q' be the maximal x-band of T crossing T at the cell corresponding to the first 
occurrence of x in uxvx. 

We consider the smallest subdiagram V of A containing all the #-bands of V crossing the 
x-band Q' . It is a comb with handle Q2 C Q. The comb V is covered by a trapezium T2 placed 
between Q' and Q, and a comb Ti with handle Q! ' . The Qj-band Q' belongs to both Y\ and 
T2. The remaining part of V is a disjoint union of two combs T3 and T4 whose handles Q3 
and Q4 contain the cells of Q that do not belong to the trapezium T2- The handle of T is the 
composition of handles Q3, Q2, Q4 of T3, T' and T4 in that order. 

Let the lengths of Q3 and Q4 be I3 and U, respectively. Let I' be the length of the handle of 

r. 

Then 

1 = 1' + l 3 + / 4 , 

and, as we proved in Step 1, I' > 1/2. 

For i G {3,4} and on = \dTi\ a , Lemma 4.10 gives inequalities 

Ai < dl 2 + 2aik (6.23) 

for some constant C\ where Ai is the area of Tj. Let ^3,^*4 be the top and the bottom of the 
trapezium T2. Here ps (resp. p±) share some initial edges with ^3 (with dT±), the rest of these 
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Figure 6: Step 3 in Lemma 6.2. 



paths belong to the boundary of A. We denote by d 3 the number of a-edges of p% and by d' 3 the 
number of its edges which do not belong to F 3 . Similarly, we introduce d 4 and d' A . Let A 2 be 
the area of T2- Then, by Lemma 4.4, 



for some constant Ci- 

Now we note that the handle Q2 of V is a copy of Q' because both maximal g-bands of the 
trapezium T2 correspond to the same basic letter x. This makes the following surgery possible. 
The diagram A is covered by two subdiagrams: V and another subdiagram Ai, having only 
the band Q in common. We construct a new auxiliary diagram by attaching T\ to Ai with 
identification of the band Q' of T\ and the band 0,2- We denote the constructed diagram by 
Ao- It is a reduced diagram because every pair of its cells having a common edge, has a copy 
either in Ti or in Ai. It follows from our constructions that the area of A does not exceed 
A2 + A3 + A4 + Ao, where Aq is the area of Ao- 

Let p 3 be the segment of the boundary ^3 that joins Q and T2 along the boundary of A. 
It follows from the definition of d 3 , d 3 , I3 and Q3, that the number of a-edges lying on p 3 is at 
least «3 — (g?3 — d' 3 ) — Ll 3 . 

Let U3 be the part of <9A that contains p 3 and connects Q with Q'. It has I3 6*-edges. Hence 
we have, by Lemma 4.6, that the length |u3| of U3 is at least 



Since U3 includes a subpath of length d' 3 having no 9-edges, we also have by inequality (4.15) 
that |u 3 | > k + 5(d' 3 - L). 

One can similarly define p and 1*4 for When passing from <9A to dA.o we replace the 
end edges of Q', U3 and U4 by two subpaths of dQ having lengths I3 and I4. Let no = |<9A|. 
Then it follows from the previous paragraph that 

n - no > 2 + <5(max(0, d 3 - L, a 3 - (d 3 - d 3 ) - 2Ll 3 ) + max(0, d' 4 - L,a 4 - (d A - d' 4 ) - 2L/ 4 )) (6.25) 
In particular, no < n — 2. By the inductive hypothesis, 



A 2 < C 2 l' (d 3 + d 4 + log I' + 1) 



(6.24) 



max(Z 3 ,Z 3 + 5(\p 3 



LI3)) > max(Z 3 ,/ 3 + 5{a 3 - {d 3 - d'3) - 2Ll 3 )). 



Ao < Mn { 



I log' n + MV(Aq) 



(6.26) 
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We note that the dispersion V(Aq) of Ao is not greater than V(A) — jrjl'il — V) by Lemmas 
5.7 and 5.5(b). 

Therefore, by inequality (6.26), the area of A is not greater than 

Mn 2 log' n + MV(A) - Mn(n - n ) log' n - - l') + A 2 + A 3 + A 4 (6.27) 

In view of inequalities (6.24) and (6.23), to obtain the desired contradiction, we should prove 
that 

Mn(n - n ) log' n + - I') > C 3 l'(d 3 + d 4 + log' l' + l) + C 3 (/| + 1\) + 2a 3 h + 2a 4 l A (6.28) 

where C 3 = max(Ci,C2) is a constant that does not depend on M. Note that we can assume 
that 

C 3 » L. (6.29) 
First we can choose M big enough so that -^V{1 - V) > C 3 (l 3 + Z 4 ) 2 > C 3 (lj + Indeed 



since /' > 1/2, and ^M^l'(l — I') > ^i{h + h)(h + h), so it is enough to assume that 



I - I' = l 3 + \ 4 < I' (6.30) 

3K 7V \ l ~ 1 > — 3 K^' 

M > 3K 2 C 3 . (6.31) 

We also have that 

y n(n - n ) log' n > C 3 /'(log' l' + 1) (6.32) 

because n — no > 2, n > 21' and M > 2C 3 by (6.31). 
It remains to prove that 

Y«(n - n ) log' n + |^ - I 1 ) > C 3 l'(d 3 + d 4 ) + 2a 3 l 3 + 2a 4 l 4 . (6.33) 

We assume without loss of generality that a 3 > a 4 , and consider two cases, 
(a) Suppose a 3 < 2C 3 (l - I'). 

Since di < a, L + d\ for i = 3,4, we also, by inequality (6.25), have 

d 3 + d 4 < a 3 + q 4 + d' 3 + d' 4 < 4C 3 (Z - /') + <5 _1 (n - n ) + 2L - 25 -1 < 4C 3 (Z - /') + 5~ l {n - n ). 

since e)" 1 > L by (4.14). 
Therefore 

- /') + Y n ( n ~ ^o)log'n > 4Cf/'(Z - + Ca^V - n )l' > C 3 l'(d 3 + d 4 ) (6.34) 
since we can assume that 

M > 12K 2 Cl M/2 > C 3 5- 1 . (6.35) 

We have also by (6.30): 

i( I' (I ~ I') > ^(k + h)(h + h) > ^-^j^ih + h) > 2a 3 l 3 + 2a 4 l 4 (6.36) 



3K 2 v ; - m 2K n ' - 3 4C 3 
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since we can assume that 

M > 2AK 2 C 3 . (6.37) 

The sum of inequalities (6.34) and (6.36) gives us the desired inequality (6.33). 

(b) Assume now that a 3 > 2C$(l — I'). Then, applying Lemma 4.10 to the comb I^, we 
obtain d 3 — d' 3 < \a 3 + 2CKl 3 < |q 3 since l 3 < I — I' < ^ and 

2C 3 > V2CK. (6.38) 

We also have d 4 — d' 4 < \a 4 + 2CKl 4 < ^a 3 . These two inequalities and inequality 6.25 lead to 

d 3 + d 4 < ^a 3 + 5' 1 (n - n ) (6.39) 

In addition, 

1 2L 1 

a 3 - (d 3 - d' 3 ) - 2Ll 3 > -a 3 - 7-^-03 > -z®z, 

ZO3 1 

since l 3 < I - I' < ^ and C 3 > 42L by (6.29). Therefore, by 6.25, 

n — no > -5a 3 . (6.40) 

Thus by (6.39) 

d 3 + d 4 < 13<5 _1 (n-no). (6.41) 
Since 21' < n and n — no > 2, inequality (6.41) implies 

— n(n - n ) log' n > C 3 l'{d 3 + d 4 ) (6.42) 
o 

because we can assume that 

M » CgtT 1 (6.43) 

(M >21C 3 5~ 1 is enough). 

Inequalities (6.40), (6.43), a 3 > and 4(/3 + I4) < n give us 

^n(n - n ) logn' > lc 3 5~ l (n - n )n > 2a 3 (l 3 + / 4 ) > 2a 3 / 3 + 2a 4 / 4 (6.44) 

D Z 

The inequality (6.33) follows now from inequalities (6.42), and (6.44). □ 

Lemma 6.3. Let n be the combinatorial length of a reduced diagram A. Then the area of A is 
0(n 2 logn). 

Proof. By Lemmas 3.3 and 5.3, we have 

V{A) < (n/2) 2 . (6.45) 

Let n' = \A\. It follows from the definition of the length that n' < n. By this inequality, 
inequality (6.45) and Lemma 6.2, we have the inequality area(A) < Mn 2 log' n + ^-n 2 . The 
lemma is proved. □ 
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7 The end of the proof 



Lemma 7.1. The Dehn function of S o Z is, up to equivalence, at least n 2 logn. 

Proof. Let us use the fact that for some fixed word W, S had the computation (3.6) for every 
n. The length of that computation is 2n and the width is n + N. 

Consider the corresponding computation of S o Z (see the proof of Lemma 4.2). Its width is 
at most n + 2N. Let l(n) be the length of that computation and let a(n) be its area. Then by 
Remark 3.19 and the description of the computation of S o Z (see Section 3.7), 

n— 1 n— 1 

2 J^(2 m +2iV-3) + (2 n +2iV-3)+2iV-l < l(n) < 2 ^ (6-2 m +2iV-3) + (6-2 n +2iV-3)+2iV-l. 

m=l m=l 

So 3 • 2™ + Ci < Z(n) < 18 • 2 n + C 2 for some constants Ci, C 2 . In addition a(n) > C'l{n) log 2 Z(n) 
for some constant C 

The sides of the corresponding trapezium A have the same labels. Thus we can take l(n) 
copies of A and glue them side by side to obtain a new trapezium A. The perimeter d of A is 
0{l{n)) and the area is 0(l(n) 2 log l(n)). By Lemmas 3.3 and 3.11, there is only one reduced 
van Kampen diagram with the same boundary label as A. 

Since l{n) is between 3 • 2™ + C\ and 18 • 2™ + C 2 , for every sufficiently large number al there 
exists a number of the form l(n) between d and 15d. Indeed it is enough to find a natural number 
n satisfying 2 n > (d — Ci)/3 and 2™ < (15<i — C 2 )/18. Such a number n exists for all sufficiently 
large d since 15/18 > 2/3. 

Hence n 2 log n is equivalent to a lower bound for the Dehn function of S o Z. □ 

Lemmas 7.1 and 6.2 show that the Dehn function of the group SoZ is equivalent to n 2 logra. 
Finally the undecidability part of Theorem 1.1 follows from Lemmas 3.15 and 3.16. Indeed, 
by these lemmas if £ is a non-recursive language then SoZ has undecidable conjugacy problem. 

Remark 7.2. An easy example of a group with Dehn function n 2 logn constructed using the 
method described in this paper is the group Sq o Z where Sq has only one gdetter {k}, one 
a-letter a, and two rules ti, r 2 , both equalling [k — > ak] (and their inverses). One can easily 
write down an explicit presentation of this group. It is an HNN extension of a free group of rank 
10 with 14 free letters. 

Remark 7.3. The method used in this paper allows one to construct groups with other small 
Dehn functions. In fact, a similar argument to the one used in the proof of Lemma 6.2 proves 
the following statement. 

Let f(n) > 1 be any function. Let S be an S-machine satisfying the condition of Lemma 3.31 
with log 2 h replaced by f(h). Then the Dehn function of the group S does not exceed n 2 f(n). 

Using the technique from [11] of converging Turing machines into 5-machines, and Remark 
7.3, one can construct S*-machines with many other Dehn functions between n 2 and n 3 (the fact 
that the Dehn function of any S'-machine does not exceed n 3 has been proved in [11]). 
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